Searching for pulsars in PALFA data from Arecibo |
Message boards : Problems and Bug Reports : Searching for pulsars in PALFA data from Arecibo
| Author | Message |
|---|---|
|
We are starting some limited public testing of a new pulsar search on Einstein@Home. This search uses data from the PALFA collaboration, taken at the Arecibo radio observatory. Science information will be available in this thread in the Science Message Board area. | |
| ID: 92812 | | |
|
How do we recognize these tasks? Do they have a different name or different application? | |
| ID: 92813 | | |
|
Will they download if running the optimized apps, or do we have to remove the app_info and run standard? | |
| ID: 92814 | | |
|
If we want to take part, can we please have instructions on how to do so. | |
| ID: 92818 | | |
|
There is no need for you to do anything, the tasks will be issued randomly between the usual "HierarchicalSearch" tasks. The number of tasks will be very limited during the testing phase (628 workunits at a time), it is rather unlikely you get one. The name of the application will be "einsteinbinary_ABP1" instead of the current "einstein_S5R4". You will be able to opt-out from the "Arecibo binary pulsar search" in your Einstein@home preferences (once the App is in the database, which will be some time tomorrow afternoon CET). Users currently running the Windows Beta App will not automatically get ABP1 tasks, I'll publish instructions and a new app_info.xml in the Beta App thread in the next days. For various technical reasons the ABP1 App will not be available for Mac OS X on PowerPC. | |
| ID: 92828 | | |
|
It would be good idea to put info at server-status about how many WU is present in the system. | |
| ID: 92890 | | |
It would be good idea to put info at server-status about how many WU is present in the system. There sure will be once this goes into production, but with the test set of just a few hundred units, it will be more like now-you-see-them,now-you-don't. CU Bikeman ____________ ![]() ![]() | |
| ID: 92891 | | |
|
The first 628 "ABP1" workunits are being produced right now and should be delivered in about 1h. | |
| ID: 92909 | | |
It would be good idea to put info at server-status about how many WU is present in the system. That's a good idea. We'll modify the server status page to show this. ____________ | |
| ID: 92915 | | |
It would be good idea to put info at server-status about how many WU is present in the system. Thank you. | |
| ID: 92925 | | |
|
Two issues are (now) known for this application: | |
| ID: 92948 | | |
|
The server that serves the ABP1 workunit data files shut itself down for an emergency at about 19:30 CET. We are investigating, it will probably not be back up before tomorow. ABP1 workunits downloaded now will probably end up as download errors (nope, they are not served by the mirror network). | |
| ID: 93050 | | |
|
The ABP1 validator stumbled over a result file that apparently forced it into an infinite loop. The program has been diabled until the problem has been fixed. No credit for ABP1 workunits in the next days. | |
| ID: 93122 | | |
|
The ABP1 validator has been fixed and is running again. | |
| ID: 93203 | | |
|
Now that ABP1 has been officially released and new workunits have been created, could we have the validator running again, please? | |
| ID: 95816 | | |
Now that ABP1 has been officially released and new workunits have been created, could we have the validator running again, please? The handling of ABP1 results will be moved to a different machine, for some time the server status page won't be able to show the daemon status correctly. Also, I note that the credit 'claim' passed back to the servers is the old benchmark*time figure. I presume that the eventual 'grant' will be a server-determined value Yes. BM | |
| ID: 95867 | | |
|
Hello, | |
| ID: 95991 | | |
Hi, this problem has been solved. All uploads should now work flawlessly. Cheers, Oliver | |
| ID: 96036 | | |
|
Windows XP , BOINC 6.2.19 | |
| ID: 96552 | | |
Windows XP , BOINC 6.2.19 Yeah, it's unfortunate that "distributed computing" has been adopted for rather malicious applications (bot-nets etc) so malware-scanners are now detecting what is normal busineess in BOINC as a suspicious thing. I'm sure there are ways to tell that scanner that the PALFA app (and the S5R5 app) are trusted by you, right? CU Bikeman ____________ ![]() ![]() | |
| ID: 96555 | | |
Windows XP , BOINC 6.2.19 How stupid is that? templates_400Hz.bank is a plain ASCII file! BM | |
| ID: 96561 | | |
Windows XP , BOINC 6.2.19 I could not find the PALFA or S5R5 apps but instead I set Firewall Rules for boinc boinccmd and boincmgr. Don't know if this was the right thing to do ? Although the download did not complete immediately, after a delay it did. It has not been necessary to set any rules in the past. Normally GDATA does not report that it is downloading files unless GDATA itself is being updated or it is updating virus signatures. Thanks for your suggestion. RJ | |
| ID: 96562 | | |
|
I guess the mere fact that an application is establishing an HTTP connection to the Internet is the reason for the alert, not the content (so it's more a personal firewall thing, not a virus scanner, I guess). You'd have to tell it that BOINC (not the apps as I indicated earlier, as BOINC is doeing the actual downloading) can be trusted. | |
| ID: 96563 | | |
|
I just noticed that two iMacs running OS 10.4 have failed on every ABP1 task. I installed the latest BOINC client, but that did not help. I have not had problems with machines running 10.5. Does this application require 10.5? | |
| ID: 96686 | | |
I just noticed that two iMacs running OS 10.4 have failed on every ABP1 task. In lieu of the lack of input from anyone that may be able to answer your query, I suggest disabling the "Arecibo Binary Pulsar Search" on your account. Perhaps at a later time, the issue will be answered and resolved. ____________ | |
| ID: 96756 | | |
I just noticed that two iMacs running OS 10.4 have failed on every ABP1 task. Will do. Thanks. ____________ | |
| ID: 96758 | | |
I just noticed that two iMacs running OS 10.4 have failed on every ABP1 task. You're welcome, also I neglected to mention before that you may see occasional messages (sometimes a few in a row) stating that there is no work for the 'Hierarchical' searches and thus no work available. Just ignore those and eventually (I haven't ran out of work because of this but I also keep a 3 to 5 day cache of work units...) E@H will send non-ABP1 work. ____________ | |
| ID: 96760 | | |
I just noticed that two iMacs running OS 10.4 have failed on every ABP1 task. Are these Intel or PowerPC iMacs? BM | |
| ID: 96992 | | |
I just noticed that two iMacs running OS 10.4 have failed on every ABP1 task. Intel | |
| ID: 97024 | | |
|
Strange WU 55643247 | |
| ID: 98430 | | |
|
WU 135113304 Why? | |
| ID: 98433 | | |
|
For the validate errors, see this thread. | |
| ID: 98435 | | |
|
Hi folks! | |
| ID: 99538 | | |
|
On the SETI boards it says that the ALFA receiver is down for maintenance until November. This might be the reason you are not getting ABP1 units. | |
| ID: 99539 | | |
On the SETI boards it says that the ALFA receiver is down for maintenance until November. This might be the reason you are not getting ABP1 units. I'm getting the units. It's just that I'm not getting the application. ____________ ![]() | |
| ID: 99541 | | |
On the SETI boards it says that the ALFA receiver is down for maintenance until November. This might be the reason you are not getting ABP1 units. How are your computing preferences in Einstein? Did you allow all applications? ____________ | |
| ID: 99543 | | |
On the SETI boards it says that the ALFA receiver is down for maintenance until November. This might be the reason you are not getting ABP1 units. Einstein has hundreds of hours of data in stock. Per day they use about 20 minutes worth, so they'll easily survive the outage at Arecibo. ____________ Jord -The BOINC FAQ Service -CUDA/CAL Stream FAQ | |
| ID: 99544 | | |
On the SETI boards it says that the ALFA receiver is down for maintenance until November. This might be the reason you are not getting ABP1 units. I haven't changed anything, other than deleting the APB beta apps and the app_info.xml file. ____________ ![]() | |
| ID: 99545 | | |
On the SETI boards it says that the ALFA receiver is down for maintenance until November. This might be the reason you are not getting ABP1 units. Did you try restarting BOINC? ____________ | |
| ID: 99546 | | |
On the SETI boards it says that the ALFA receiver is down for maintenance until November. This might be the reason you are not getting ABP1 units. Yes. The problem isn't on my end. ____________ ![]() | |
| ID: 99552 | | |
... I haven't changed anything, other than deleting the APB beta apps and the app_info.xml file. I haven't been paying proper attention lately - I wasn't even aware that the test app had been made 'official'. What I normally do is run the cache dry and report, stop BOINC, delete app_info.xml and delete the reference to the test app that is inserted into the state file. I don't delete the test app itself if the same app has become 'official'. When BOINC restarts with no reference to the test app in the state file, it realises that it has to go to the project for the official app, but it also realises that it already has a copy of the official app so it skips the download and just continues to use the test app as the official app - since nothing has really changed. If you still have a copy of the test app, maybe this technique would 'work around' the problem of not being able to download (for whatever reason) the new app. just wondering out loud :-). ____________ Cheers, Gary. | |
| ID: 99576 | | |
|
Resetting the project, after deleting the app_info.xml file & restarting BOINC, will in the least make sure the correct application version is downloaded again. | |
| ID: 99577 | | |
Resetting the project, after deleting the app_info.xml file & restarting BOINC, will in the least make sure the correct application version is downloaded again. I was just going to suggest the same thing! In fact, I'm surprised that Gary's method works, because in general Beta apps are unsigned, but BOINC requires a signature for the production apps. | |
| ID: 99578 | | |
In fact, I'm surprised that Gary's method works, because in general Beta apps are unsigned, but BOINC requires a signature for the production apps. That would explain this:-) 19/09/2009 15:41:20|Einstein@Home|[error] Application file einstein_S5R5_3.05_windows_intelx86.exe missing signature I only had deleted the app_info.xml file. However, there are no problems since, even without a reset.19/09/2009 15:41:20|Einstein@Home|[error] BOINC cannot accept this file Gruß, Gundolf ____________ Computer sind nicht alles im Leben. (Kleiner Scherz) ![]() | |
| ID: 99580 | | |
Resetting the project, after deleting the app_info.xml file & restarting BOINC, will in the least make sure the correct application version is downloaded again. True - but I've always had an aversion to resetting projects if there is another way ... :-). Just deleting the app_info.xml file will leave information in the client_state.xml file that an anonymous platform is being used. Which was why I said to "delete the reference to the test app that is inserted into the state file." For someone with a single machine and no experience with playing around in the state file, resetting is obviously a far less risky way to go. ____________ Cheers, Gary. | |
| ID: 99584 | | |
... I'm surprised that Gary's method works, because in general Beta apps are unsigned, but BOINC requires a signature for the production apps. Not only does the method work but cutting and pasting the signature from another machine running the official app works like a charm as well. I tried cutting and pasting first but then discovered I didn't even need to do that as long as I removed the AP stuff from the state file. The state file ended up with the signature inserted automatically - I guess as a result of the scheduler exchange that ended with the "file exists - skipping download" message. It's a while since I last did that so I'm a bit hazy on the details but I'm sure that I was able to do this without even running the cache dry. I just examined the differences between two state files, one running under AP and the other running the same app 'officially'. It was pretty obvious what needed to be deleted. I used to pre-prepare the signature as a small block to be inserted and then do each machine sequentially. It took around a minute per machine to go from crunching under AP to crunching the same tasks with the same app - but 'officially'. These days I don't even bother inserting the signature - I just delete the AP stuff. I don't know if anything has changed with more recent BOINCs that would invalidate the procedure. Another thing I've found I can do by playing around in the state file is completely recover trashed caches. On quad core machines, I quite often have a cache of 50-100 tasks or more. My internet plan with my ISP has a 6GB monthly limit with a 60GB off-peak free allowance. I run a cron job twice a day on one machine that uses boinccmd to stop network access on all machines just before the start of the peak period and then to allow access just after the start of the off-peak period. That way all uploading and downloading is done in the free period. So with this regimen in place and with the peak period being during the day and evening, I tend to keep an eye out for problems during that time. If something goes wrong and a cache gets trashed, nothing gets reported to the project and an easy recovery is usually possible. As an example, I had an overheating problem on a machine with a dodgy CPU fan. Fortunately it happened and was discovered during peak hours when comms were disabled. The machine didn't crash - it just trashed the entire cache :-). After replacing the fan, I was able to restart the machine and edit the state file to remove all the trashed results, leaving the small number of results that had been completed without error prior to the fan going south. When I restarted BOINC and re-enabled network access, the server promptly sent a flood of missing results and accepted the upload of the good results that had been saved. All I lost was the time that had been spent on the 4 current tasks in flight, up to the point that the temperature became too hot. I would have done at least 10 full cache recoveries like this (for various reasons) over the last year or two. As yet, I haven't had a single recovery fail. Gotta love that 'resend lost results' BOINC feature :-). ____________ Cheers, Gary. | |
| ID: 99586 | | |
Message boards :
Problems and Bug Reports :
Searching for pulsars in PALFA data from Arecibo