Small issue/question...

Thalpha
Thalpha
Joined: 11 Jun 06
Posts: 4
Credit: 2675
RAC: 0
Topic 191396

Hello all,

I have a question regarding the Einstein project within the BOINC manager I run. Each time it is resumed it restarts from 0%, no matter where it was suspended (when I turned off the PC for example or manually suspended it).
I also get the following message: "Task XXXXXX exited with zero status but no 'finished' file. If this happens repeatedly you may need to reset the project."
What can I do about this? How do I restart the project?
Will this solve the problem?

Pooh Bear 27
Pooh Bear 27
Joined: 20 Mar 05
Posts: 1376
Credit: 20312671
RAC: 0

Small issue/question...

Does it start from zero, but then in a few minutes pop back to a larger percentage? You have returned a few results, and they look good. The exit with 0 status is kind of a weird error, and is usually not a problem.

If you want to reset it, I would suggest you set the option "No New Work" when it finishes, do a final update, then detach and reattach. You will probably get a new ID, but this may help.

ersatzjim
ersatzjim
Joined: 9 Dec 05
Posts: 117
Credit: 3982042
RAC: 0

I recently came to understand

I recently came to understand that the restart points are set by the "Write to disk at most every" setting under "Your Account" / "General Preferences".

I had thought they were written in the code somewhere - maybe they are - I seem to recall a conversation about this sometime in Dec.

I may be off base here, but I think the info came from Eric Korpela, one of the top scientist cats over at SETI, in the last few days.

Jim

Those who don’t build must burn. It’s as old as history and juvenile delinquents.
Ray Bradbury - Fahrenheit 451

Thalpha
Thalpha
Joined: 11 Jun 06
Posts: 4
Credit: 2675
RAC: 0

me again... For e.g. a

me again...

For e.g. a task is at 40% after 3hours. I turn off the PC and when I turn it on it restarts the task from 0% although the CPU time is 3h. It does not jump forward after a while...it keeps on going from scratch.

: If I change the settings for "Write to disk at most every" option will it have any effect on this? It is currently set to 60 sec. What value should it be to solve the issue...

10q

Michael Karlinsky
Michael Karlinsky
Joined: 22 Jan 05
Posts: 888
Credit: 23502182
RAC: 0

Seems that albert is unable

Seems that albert is unable to read the checkpoint after it exits with
"no heartbeat from core client" error message. The second time the checkpoint
is read successfully. The "no heartbeat" message can usually be ignored, maybe not in your case.

Try to reset the project after the current WU is finished and reported.

HTH

Michael

5.4.9

2006-06-14 23:01:28.0781 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-14 23:01:28.0781 [normal]: Started search at lalDebugLevel = 0
2006-06-14 23:01:29.5468 [normal]: Checkpoint-file 'Fstat.out.ckp' not found.
2006-06-14 23:01:29.5468 [normal]: No usable checkpoint found, starting from beginning.
2006-06-14 23:09:21.8906 [normal]: Fstat file reached MaxFileSizeKB ==> compactifying ... done.

2006-06-15 20:54:39.8125 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-15 20:54:39.8281 [normal]: Started search at lalDebugLevel = 0
2006-06-15 20:54:41.3437 [normal]: Found checkpoint-file 'Fstat.out.ckp'
2006-06-15 20:54:41.3906 [normal]: Trying to read Fstat-file into toplist ...
2006-06-15 20:54:44.5468 [normal]: Checksum Ok. Successfully read_toplist_from_fp()
2006-06-15 20:54:44.5468 [normal]: Resuming computation at (23268/109964945/2207927).
No heartbeat from core client for 31 sec - exiting

2006-06-16 17:22:07.5625 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-16 17:22:07.5625 [normal]: Started search at lalDebugLevel = 0
2006-06-16 17:22:09.0312 [normal]: Found checkpoint-file 'Fstat.out.ckp'
Failed to read checkpoint-counters from 'Fstat.out.ckp'!
2006-06-16 17:22:09.0312 [normal]: No usable checkpoint found, starting from beginning.
2006-06-16 17:51:33.8437 [normal]: Fstat file reached MaxFileSizeKB ==> compactifying ... done.

2006-06-16 18:47:55.8281 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-16 18:47:55.8281 [normal]: Started search at lalDebugLevel = 0
2006-06-16 18:47:57.2187 [normal]: Found checkpoint-file 'Fstat.out.ckp'
2006-06-16 18:47:57.2187 [normal]: Trying to read Fstat-file into toplist ...
2006-06-16 18:48:00.2187 [normal]: Checksum Ok. Successfully read_toplist_from_fp()
2006-06-16 18:48:00.2187 [normal]: Resuming computation at (23042/109577076/2200170).
2006-06-16 23:32:26.0937 [normal]: Search finished successfully.

Thalpha
Thalpha
Joined: 11 Jun 06
Posts: 4
Credit: 2675
RAC: 0

10x... I will try this...the

10x...
I will try this...the problem is that now it's not only Einstein...
SETI is doing the same thing...:(

Pooh Bear 27
Pooh Bear 27
Joined: 20 Mar 05
Posts: 1376
Credit: 20312671
RAC: 0

RE: 10x... I will try

Message 38071 in response to message 38070

Quote:
10x...
I will try this...the problem is that now it's not only Einstein...
SETI is doing the same thing...:(


I have a few questions, suggestions:

What Anti-Virus software are you running? If Norton, or I think there was another, have it NOT scan the BOINC folder and subdirectories.

Have you done a disk check or defrag lately, or do either of these automatically run? Projects can not be running when these happen, it causes bad things to happen.

What are all the projects you are running? Just Seti and Einstein? If so, set both to no new work, allow them to finish, and do a final update to make sure everything gets reported correctly. Then uninstall, and reinstall.

Thalpha
Thalpha
Joined: 11 Jun 06
Posts: 4
Credit: 2675
RAC: 0

My antivirus is

My antivirus is BitDefender...and I did not scan lately...nor defrag.
I run Einstein, SETI and Rosetta.

I will try to reinstall...hope it works this way :)

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.