WU stuck folding forever

This forum is for discussions about the Motherboards.org Folding team. What is folding? Venture on in for a look.

Moderator: The Mod Squad

WU stuck folding forever

Postby Myth » Tue Jan 15, 2008 8:56 pm

This WU was folding just fine until yesterday. I have it backup every 15 minutes. was seeing a frame get finished every 4 timed backup. Check this one out. I shut off folding and restarted this morning:


--- Opening Log file [January 15 15:31:23]


# Windows Console Edition #####################################################
###############################################################################

Folding@Home Client Version 5.02

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: D:\Folding1
Executable: D:\Folding1\FAH502-Console.exe
Arguments: -local -verbosity 9 -forceasm -advmethods

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[15:31:23] - Ask before connecting: No
[15:31:23] - User name: Myth42 (Team 33258)
[15:31:23] - User ID: 84E3AC33A45DFDD
[15:31:23] - Machine ID: 1
[15:31:23]
[15:31:23] Loaded queue successfully.
[15:31:23] + Benchmarking ...
[15:31:27] The benchmark result is 4720
[15:31:27]
[15:31:27] + Processing work unit
[15:31:27] Core required: FahCore_79.exe
[15:31:27] Core found.
[15:31:27] Working on Unit 03 [January 15 15:31:27]
[15:31:27] + Working ...
[15:31:27] - Calling 'FahCore_79.exe -dir work/ -suffix 03 -priority 96 -checkpoint 15 -forceasm -verbose -lifeline 1936 -version 502'

[15:31:27] - Autosending finished units...
[15:31:27] Trying to send all finished work units
[15:31:27] + No unsent completed units remaining.
[15:31:27] - Autosend completed
[15:31:47]
[15:31:47] *------------------------------*
[15:31:47] Folding@Home Double Gromacs Core
[15:31:47] Version 1.91 (April 11, 2006)
[15:31:47]
[15:31:47] Preparing to commence simulation
[15:31:47] - Assembly optimizations manually forced on.
[15:31:47] - Not checking prior termination.
[16:13:42] - Expanded 4900136 -> 34412398 (decompressed 702.2 percent)
[16:19:27]
[16:19:27] Project: 3908 (Run 2241, Clone 4, Gen 0)
[16:19:27]
[16:26:02] Assembly optimizations on if available.
[16:26:02] Entering M.D.
[16:32:07] (Starting from checkpoint)
[16:32:17] Protein: IBX in water
[16:32:17]
[16:32:27] Writing local files
[16:33:07] Completed 12264 out of 25000 steps (49)
[17:26:55] Timered checkpoint triggered.
[17:52:38] Timered checkpoint triggered.
[18:17:14] Timered checkpoint triggered.
[18:49:44] Timered checkpoint triggered.
[19:15:45] Timered checkpoint triggered.
[19:41:44] Timered checkpoint triggered.
[20:07:56] Timered checkpoint triggered.
[20:33:57] Timered checkpoint triggered.
[21:06:13] Timered checkpoint triggered.
[21:30:13] - Autosending finished units...
[21:30:13] Trying to send all finished work units
[21:30:13] + No unsent completed units remaining.
[21:30:13] - Autosend completed
[21:34:33] Timered checkpoint triggered.
[22:00:29] Timered checkpoint triggered.
[22:25:08] Timered checkpoint triggered.
[22:50:50] Timered checkpoint triggered.
[23:23:30] Timered checkpoint triggered.
[23:49:10] Timered checkpoint triggered.
[00:14:44] Timered checkpoint triggered.
[00:40:30] Timered checkpoint triggered.
[01:06:21] Timered checkpoint triggered.
[01:38:42] Timered checkpoint triggered.
[02:03:34] Timered checkpoint triggered.
[02:29:28] Timered checkpoint triggered.
[02:55:20] Timered checkpoint triggered.
[03:21:04] Timered checkpoint triggered.
[03:27:46] - Autosending finished units...
[03:27:46] Trying to send all finished work units
[03:27:46] + No unsent completed units remaining.
[03:27:46] - Autosend completed
[03:53:24] Timered checkpoint triggered.
[04:19:13] Timered checkpoint triggered.


Has anyone seen anything like this? Suddenly taking forever when it was working just fine.

I have noticed that my machine is being a bit sluggish to do regular things (open a browser window, open or close a program ... etc). Everything was working just fine until this frame started to render.

Any recomendations?
Image
Image
Myth
Black Belt 3rd Degree
Black Belt 3rd Degree
 
Posts: 3094
Joined: Fri Aug 29, 2003 1:13 pm
Location: Limboland

Postby Myth » Wed Jan 16, 2008 12:10 am

OK.... I rebooted the machine and all seems to be back to normal. Don't know why it freaked out like that without anything provoking it.
Image
Image
Myth
Black Belt 3rd Degree
Black Belt 3rd Degree
 
Posts: 3094
Joined: Fri Aug 29, 2003 1:13 pm
Location: Limboland

Postby Pette Broad » Wed Jan 16, 2008 3:54 am

I've seen it once or twice, rebooting as you've already done seems to cure it. One thing you have to be watch is that if the WU is not making progress then it throws up an error and deletes the unit. I'm not sure after how long it does this but you look to have caught it in time :)

Pete
Image
Pette Broad
Black Belt 5th Degree
Black Belt 5th Degree
 
Posts: 5490
Joined: Tue Jul 10, 2001 12:01 am
Location: Flintshire, U.K

Postby Karlsweldt » Wed Jan 16, 2008 7:39 am

Likely a stray bit of malware or spybot that came from a site visit. May be time to ensure the system is sanitary. Even if the unit isn't used for browsing, a port attack may find a vulnerable entry for viral or malware pestilence.
You may be over-cautious about forced checkpoints. Just setting the checkpoint references for restarts should be sufficient. Have not had a WU failure in quite a while due to checkpoint failure.
F@H.. to solve mankind's maladies.. in our lifetimes!
Karlsweldt
Mobo-fu Master
Mobo-fu Master
 
Posts: 20659
Joined: Wed Nov 12, 2003 11:57 am
Location: 07438


Return to Motherboards.org Folding Team

Who is online

Users browsing this forum: No registered users and 1 guest