Download speeds slow for BRP data |
Message boards : Cruncher's Corner : Download speeds slow for BRP data
| Author | Message |
|---|---|
|
I noticed today that the BRP data downloads were very slow and even going into backoff. Is it that E@H has so much GPU power at its disposal that the network is unable to keep up? | |
| ID: 113982 | | |
|
Hi | |
| ID: 114007 | | |
|
Unfortunately while I have a decent DSL speed it has download limits, so I tend to use the off-peak period for a couple of the machines. The off-peak window is only 6 hours, so download speed effects those machines as they try and replenish their 1 day cache. | |
| ID: 114016 | | |
Hi It would be interesting to learn a little bit more about the WU generation process and the stages involved. In particular, it feels to me as if it's a batch process, with blocks of WUs being released in clumps, rather than a steady flow. I'm also seeing great variability in download speeds. Sometimes no progress is possible at all, sometimes they crawl down at 5 or 10 KB/sec, and sometimes they come through at up to 100 KB/sec. It led me to wonder whether the intermediate datafile storage server had difficulty coping with both high-speed data insertion and high-speed downloading at the same time. | |
| ID: 114017 | | |
|
For the first time in days I had now normal download speeds. Last week it was arround 8-15kB/s. | |
| ID: 114019 | | |
|
I think we are really staying near a moment when users machines will be able to get the whole beam at once by downloading large data file with raw telescope data, crunch it from the beginnig to the end completely (including pre-processing and post-processing), and return back to server all the data in a post processed state. There are a lot of modern users machines that are able to do this job already today. They alread have rather good internet connection and are able to work with huge databases right in big enough PCs memory. So, it will be great if we will have an opportunity to crunch BIG WUs on sufficient machines. BOINC already can return machine capacity information to server so it can decide whether it is possible for that machine to do BIG job at once. | |
| ID: 114098 | | |
|
I'm going into backoff for nearly all of my downloads (doesn't matter what kind of job (CPU only)). DSL, constant internet presence. (5 machines are showing this activity) | |
| ID: 114249 | | |
|
I'm seeing five different behaviours: | |
| ID: 114250 | | |
|
Is there a possibility of using a different compression algorithm for BRP4? Would using LZMA2 for example reduce the size of each task some? I am not sure what the existing compression algorithm is. With each task being 32MB and GPUs being able to process these tasks in as little as 20-minutes, the transfer requirements become significant. This is even more so the case when multiple GPUs are running. | |
| ID: 114254 | | |
Is there a possibility of using a different compression algorithm for BRP4? Would using LZMA2 for example reduce the size of each task some? I am not sure what the existing compression algorithm is. With each task being 32MB and GPUs being able to process these tasks in as little as 20-minutes, the transfer requirements become significant. This is even more so the case when multiple GPUs are running. well, there are several projects which use 7z http://www.7-zip.org/ to compress data. | |
| ID: 114255 | | |
Is there a possibility of using a different compression algorithm for BRP4? Would using LZMA2 for example reduce the size of each task some? I am not sure what the existing compression algorithm is. With each task being 32MB and GPUs being able to process these tasks in as little as 20-minutes, the transfer requirements become significant. This is even more so the case when multiple GPUs are running. and applying 7-zip to a random data file reduces its size from 4,098 KB to 3,979 KB - about 3% compression. I do think the project might have thought of that one, if it was going to be any significant use ;-) | |
| ID: 114256 | | |
Is there a possibility of using a different compression algorithm for BRP4? Would using LZMA2 for example reduce the size of each task some? I am not sure what the existing compression algorithm is. With each task being 32MB and GPUs being able to process these tasks in as little as 20-minutes, the transfer requirements become significant. This is even more so the case when multiple GPUs are running. yup - it depends on the kind of data. at least worth a test.. | |
| ID: 114257 | | |
Is there a possibility of using a different compression algorithm for BRP4? Would using LZMA2 for example reduce the size of each task some? I am not sure what the existing compression algorithm is. With each task being 32MB and GPUs being able to process these tasks in as little as 20-minutes, the transfer requirements become significant. This is even more so the case when multiple GPUs are running. Well. I did the test. BRP4 can be compressed to 91% of original size according to WinRAR. They are partly text files and partly data files with lots of zeroes in there. So it is the moment to do something with data. Not with the network bandwidth. | |
| ID: 114260 | | |
Well. I did the test. BRP4 can be compressed to 91% of original size according to WinRAR. They are partly text files and partly data files with lots of zeroes in there. So it is the moment to do something with data. Not with the network bandwidth. alexander's winrar probably is not an option because it's payware. 7z is open-source and to my experience often performs better.. | |
| ID: 114270 | | |
Well. I did the test. BRP4 can be compressed to 91% of original size according to WinRAR. They are partly text files and partly data files with lots of zeroes in there. So it is the moment to do something with data. Not with the network bandwidth. I think there will be no difference in compression between both WinRAR and 7z. | |
| ID: 114274 | | |
Well. I did the test. BRP4 can be compressed to 91% of original size according to WinRAR. They are partly text files and partly data files with lots of zeroes in there. So it is the moment to do something with data. Not with the network bandwidth. Both WinRAR and 7z can handle multiple compression formats. It isn't the tool that you use that matters, it's the compression algorithm that you choose which has to be one which balances compression ratios with server CPU load. Remember too that the servers run Linux, so whatever tool is chosen, it won't be one with "Win..." in its name ;-) | |
| ID: 114275 | | |
|
Hi! | |
| ID: 114277 | | |
|
I tried a few different compression options with one of the 4MB files: | |
| ID: 114280 | | |
The data rates here are prodigious - BRP4 requires roughly four times as much data per unit of computation time, as even the fastest 'shorty' SETI work. And with SETI being largely down at the moment, a lot of that data demand will have transferred here. Guilty as charged. Steve ____________ Crunching as member of The GPU Users Group team. ![]() | |
| ID: 114283 | | |
|
I think we should try. If it will overload servers alittle, than it is no trouble to get everything back to normal. Am I right? | |
| ID: 114297 | | |
|
Downloadspeeds are very low @ the time. Was away for some hours and get only 7 WUs downloaded, the over 20 others are still loading O.o Had to primegrid one WU before the first einstein WU was ready to crunch. So the packing is not the real clue that seems to help out of speed problems ^^ | |
| ID: 114300 | | |
|
I did suggest a change to the way BOINC handles backoffs. It seems Dr A has put it into the too hard basket. If the projects start asking for it then he may look at doing it. Date: Sat, 01 Oct 2011 10:37:23 -0700 ____________ BOINC blog | |
| ID: 114301 | | |
|
It's getting to the point now where I'm running out of CPU work on a couple of machines because I can't get work downloaded. Some of that work has a due date that I may not be able to meet if it isn't downloaded soon :( | |
| ID: 114306 | | |
|
looks like the logjam has broken. what, did SETI come back online? | |
| ID: 114359 | | |
|
There seems to be a new problem with downloading BRP4-files. | |
| ID: 114365 | | |
looks like the logjam has broken. what, did SETI come back online? Seti is still having all kinds of issues. Work only trickles out. It has had little affect on the computers that don't produce a great deal of work, but has crippled the top hosts. Steve ____________ Crunching as member of The GPU Users Group team. ![]() | |
| ID: 114366 | | |
|
It seems Dr A has had a rethink after a few more emails on the mailing list and is going to look into this. I did suggest a change to the way BOINC handles backoffs. It seems Dr A has put it into the too hard basket. If the projects start asking for it then he may look at doing it. | |
| ID: 114409 | | |
|
Haven't noticed DownLoad trouble or no D'Load, at all. | |
| ID: 114455 | | |
|
Is there any other BOINC project that uses more than a single server for download? I would imagine that Einstein@home is really a special case in that respect. | |
| ID: 114477 | | |
Is there any other BOINC project that uses more than a single server for download? I would imagine that Einstein@home is really a special case in that respect. CPDN (Climate Prediction) uses multiple download servers, but they've taken a different approach. The client is only given one url, but for redirection or redundancy it can be of the form <url>http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/mirror.php?file=/hadam3p_6.14_windows_intelx86.exe</url> - I would guess the decision as to exactly which machine handles the load is made by the php script running on the server. SETI@home use a single download url, but round-robin DNS to distribute the load across two servers. When they need to distribute vast numbers of identical files, for example during an application roll-out, they used some form of transparent redirection to a proxy cache server, but some ISPs choked on that and prevented some clients getting the files. I guess having multiple servers only really helps for files which can easily (and efficiently) be made available in more than one place at once - applications, and the locality data files that only Einstein uses. But the problem of the client stalling on files required for one application class, preventing the download of files for a different application within the same project, could be more general than just Einstein. | |
| ID: 114480 | | |
Message boards :
Cruncher's Corner :
Download speeds slow for BRP data