Author |
Message |
|
I don't receive any new WU due the following error message(s) :
15/05/2009 23:09:12 GPUGRID Requesting new tasks
15/05/2009 23:09:17 GPUGRID Scheduler request completed: got 0 new tasks
15/05/2009 23:09:17 GPUGRID Message from server: No work sent
15/05/2009 23:09:17 GPUGRID Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
For one or the other bizarre reason, the server thinks My computer is a PS3 ?
During BOINC initiation, I get the following messages :
GenuineIntel Intel(R) Core(TM)2 Quad CPU @ 2.40GHz [x86 Family 6 Model 15 Stepping 7]
Processor features: fpu tsc pae nx sse sse2 mmx
OS: Microsoft Windows XP: Professional x86 Edition, Service Pack 3, (05.01.2600.00)
Memory: 2.00 GB physical, 3.85 GB virtual
Disk: 465.76 GB total, 71.29 GB free
Local time is UTC +2 hours
CUDA device: GeForce 8800 GT (driver version 18585, compute capability 1.1, 512MB, est. 73GFLOPS)
Computer ID: 28542
What's going on here ?
Upgraded Boinc, Display and Cuda drivers to the latest and greatest to resolve never ending WU issues. Partially works; they do error out from time to time ...
Crunched for about 2 months without any problems (apart from the easter server crash)till last week...
|
|
|
|
After a reboot and 2 or three failed request, I received a WU.
So problems are solved, but I am a bit puzzled (that's not that hard...) about the error messages... |
|
|
|
So problems are solved, but I am a bit puzzled (that's not that hard...) about the error messages...
It think it's still the standard message when no GPU-WUs are available but the server still has PS3-Grid WUs. However, this message has also been sent out in other cases of errors.
MrS
____________
Scanning for our furry friends since Jan 2002 |
|
|
|
It is also related to the 'debt' parameter see this thread:
http://www.gpugrid.net/forum_thread.php?id=1070#10371
____________
Join team Bletchley Park, the innovators. |
|
|
Bymark Send message
Joined: 23 Feb 09 Posts: 30 Credit: 5,897,921 RAC: 0 Level
Scientific publications
|
Moore bad WU's:
http://www.gpugrid.net/workunit.php?wuid=616590
____________
"Silakka"
Hello from Turku > Åbo. |
|
|
Rob.BSend message
Joined: 18 Apr 09 Posts: 3 Credit: 2,542,440 RAC: 0 Level
Scientific publications
|
All the GPUGRID WU's I download fail as follows, no issues until a few days ago.
Any ideas?
10/08/2009 17:54:47 GPUGRID Starting 403-GIANNI_DOPc-4-25-RND8363_0
10/08/2009 17:54:47 GPUGRID Starting task 403-GIANNI_DOPc-4-25-RND8363_0 using acemd version 667
10/08/2009 17:54:56 GPUGRID Computation for task 403-GIANNI_DOPc-4-25-RND8363_0 finished
10/08/2009 17:54:56 GPUGRID Output file 403-GIANNI_DOPc-4-25-RND8363_0_1 for task 403-GIANNI_DOPc-4-25-RND8363_0 absent
10/08/2009 17:54:56 GPUGRID Output file 403-GIANNI_DOPc-4-25-RND8363_0_2 for task 403-GIANNI_DOPc-4-25-RND8363_0 absent
10/08/2009 17:54:56 GPUGRID Output file 403-GIANNI_DOPc-4-25-RND8363_0_3 for task 403-GIANNI_DOPc-4-25-RND8363_0 absent
10/08/2009 17:54:56 SETI@home Restarting task 22dc08ad.4376.103450.8.10.247_0 using setiathome_enhanced version 608
10/08/2009 17:54:59 GPUGRID Started upload of 403-GIANNI_DOPc-4-25-RND8363_0_0
10/08/2009 17:55:03 GPUGRID Finished upload of 403-GIANNI_DOPc-4-25-RND8363_0_0
10/08/2009 17:55:05 Milkyway@home Sending scheduler request: To fetch work.
10/08/2009 17:55:05 Milkyway@home Requesting new tasks
10/08/2009 17:55:10 Milkyway@home Scheduler request completed: got 0 new tasks
10/08/2009 17:55:10 Milkyway@home Message from server: No work available
10/08/2009 17:55:22 GPUGRID Finished download of 129-GIANNI_DOPa-9-pdb_file
10/08/2009 17:55:22 GPUGRID Started download of 129-GIANNI_DOPa-9-par_file
10/08/2009 17:55:26 GPUGRID Finished download of 129-GIANNI_DOPa-9-par_file
10/08/2009 17:55:26 GPUGRID Started download of 129-GIANNI_DOPa-9-129
10/08/2009 17:55:28 GPUGRID Finished download of 129-GIANNI_DOPa-9-129
10/08/2009 17:55:28 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-LICENSE
10/08/2009 17:55:29 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-LICENSE
10/08/2009 17:55:29 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-COPYRIGHT
10/08/2009 17:55:31 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-COPYRIGHT
10/08/2009 17:55:31 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62-KASHIF_HIVPR_twomons_far_ba7-17-100-RND4467_1
10/08/2009 17:55:43 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62-KASHIF_HIVPR_twomons_far_ba7-17-100-RND4467_1
10/08/2009 17:55:43 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62-KASHIF_HIVPR_twomons_far_ba7-17-100-RND4467_2
10/08/2009 17:55:55 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62-KASHIF_HIVPR_twomons_far_ba7-17-100-RND4467_2
10/08/2009 17:55:55 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62-KASHIF_HIVPR_twomons_far_ba7-17-100-RND4467_3
10/08/2009 17:55:59 GPUGRID Sending scheduler request: To report completed tasks.
10/08/2009 17:55:59 GPUGRID Reporting 1 completed tasks, not requesting new tasks
10/08/2009 17:56:00 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62-KASHIF_HIVPR_twomons_far_ba7-17-100-RND4467_3
10/08/2009 17:56:00 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-pdb_file
10/08/2009 17:56:05 GPUGRID Scheduler request completed: got 0 new tasks
10/08/2009 17:56:09 GPUGRID Finished download of 129-GIANNI_DOPa-9-psf_file
10/08/2009 17:56:09 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-psf_file
10/08/2009 17:56:10 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-psf_file
10/08/2009 17:56:10 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-par_file
10/08/2009 17:56:12 GPUGRID Starting 129-GIANNI_DOPa-9-25-RND1401_1
10/08/2009 17:56:12 GPUGRID Starting task 129-GIANNI_DOPa-9-25-RND1401_1 using acemd version 667
10/08/2009 17:56:16 Milkyway@home Sending scheduler request: To fetch work.
10/08/2009 17:56:16 Milkyway@home Requesting new tasks
10/08/2009 17:56:17 GPUGRID Computation for task 129-GIANNI_DOPa-9-25-RND1401_1 finished
10/08/2009 17:56:17 GPUGRID Output file 129-GIANNI_DOPa-9-25-RND1401_1_1 for task 129-GIANNI_DOPa-9-25-RND1401_1 absent
10/08/2009 17:56:17 GPUGRID Output file 129-GIANNI_DOPa-9-25-RND1401_1_2 for task 129-GIANNI_DOPa-9-25-RND1401_1 absent
10/08/2009 17:56:17 GPUGRID Output file 129-GIANNI_DOPa-9-25-RND1401_1_3 for task 129-GIANNI_DOPa-9-25-RND1401_1 absent
10/08/2009 17:56:17 SETI@home Restarting task 22dc08ad.4376.103450.8.10.247_0 using setiathome_enhanced version 608
10/08/2009 17:56:19 GPUGRID Started upload of 129-GIANNI_DOPa-9-25-RND1401_1_0
10/08/2009 17:56:21 Milkyway@home Scheduler request completed: got 0 new tasks
10/08/2009 17:56:22 GPUGRID Finished upload of 129-GIANNI_DOPa-9-25-RND1401_1_0
10/08/2009 17:56:52 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-pdb_file
10/08/2009 17:56:52 GPUGRID Started download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62
10/08/2009 17:56:53 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-62
10/08/2009 17:56:53 GPUGRID Started download of 59-GIANNI_BINDX119-37-LICENSE
10/08/2009 17:56:54 GPUGRID Finished download of 59-GIANNI_BINDX119-37-LICENSE
10/08/2009 17:56:54 GPUGRID Started download of 59-GIANNI_BINDX119-37-COPYRIGHT
10/08/2009 17:56:55 GPUGRID Finished download of 59-GIANNI_BINDX119-37-COPYRIGHT
10/08/2009 17:56:55 GPUGRID Started download of 59-GIANNI_BINDX119-37-59-GIANNI_BINDX119-36-100-RND8540_1
10/08/2009 17:57:03 GPUGRID Finished download of 59-GIANNI_BINDX119-37-59-GIANNI_BINDX119-36-100-RND8540_1
10/08/2009 17:57:03 GPUGRID Started download of 59-GIANNI_BINDX119-37-59-GIANNI_BINDX119-36-100-RND8540_2
10/08/2009 17:57:13 GPUGRID Finished download of 59-GIANNI_BINDX119-37-59-GIANNI_BINDX119-36-100-RND8540_2
10/08/2009 17:57:13 GPUGRID Started download of 59-GIANNI_BINDX119-37-59-GIANNI_BINDX119-36-100-RND8540_3
10/08/2009 17:57:20 GPUGRID Finished download of 59-GIANNI_BINDX119-37-59-GIANNI_BINDX119-36-100-RND8540_3
10/08/2009 17:57:20 GPUGRID Started download of 59-GIANNI_BINDX119-37-pdb_file
10/08/2009 17:57:20 GPUGRID Sending scheduler request: To report completed tasks.
10/08/2009 17:57:20 GPUGRID Reporting 1 completed tasks, not requesting new tasks
10/08/2009 17:57:26 GPUGRID Scheduler request completed: got 0 new tasks
10/08/2009 17:57:31 Milkyway@home Sending scheduler request: To fetch work.
10/08/2009 17:57:31 Milkyway@home Requesting new tasks
10/08/2009 17:57:37 Milkyway@home Scheduler request completed: got 1 new tasks
10/08/2009 17:57:37 GPUGRID Finished download of 62-KASHIF_HIVPR_twomons_far_ba7-18-par_file
10/08/2009 17:57:37 GPUGRID Started download of 59-GIANNI_BINDX119-37-psf_file
10/08/2009 17:57:38 GPUGRID Finished download of 59-GIANNI_BINDX119-37-psf_file
10/08/2009 17:57:38 GPUGRID Started download of 59-GIANNI_BINDX119-37-par_file
10/08/2009 17:57:38 Milkyway@home Started download of gs_82test_2Stream_1_search_parameters_2300072_1249923447
10/08/2009 17:57:39 Milkyway@home Finished download of gs_82test_2Stream_1_search_parameters_2300072_1249923447
10/08/2009 17:57:39 GPUGRID Starting 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0
10/08/2009 17:57:39 GPUGRID Starting task 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0 using acemd version 667
10/08/2009 17:57:44 GPUGRID Computation for task 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0 finished
10/08/2009 17:57:44 GPUGRID Output file 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0_1 for task 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0 absent
10/08/2009 17:57:44 GPUGRID Output file 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0_2 for task 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0 absent
10/08/2009 17:57:44 GPUGRID Output file 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0_3 for task 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0 absent
10/08/2009 17:57:44 SETI@home Restarting task 22dc08ad.4376.103450.8.10.247_0 using setiathome_enhanced version 608
10/08/2009 17:57:47 GPUGRID Started upload of 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0_0
10/08/2009 17:57:50 GPUGRID Finished upload of 62-KASHIF_HIVPR_twomons_far_ba7-18-100-RND4467_0_0
10/08/2009 17:57:54 GPUGRID Finished download of 59-GIANNI_BINDX119-37-pdb_file
10/08/2009 17:57:54 GPUGRID Started download of 59-GIANNI_BINDX119-37-59
|
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
try to restart the machine.
Do they still fail?
gdf |
|
|
|
try to restart the machine.
Do they still fail?
gdf
He's running driver 182.50, according to http://www.gpugrid.net/hosts_user.php?userid=21516.
Clue? |
|
|
|
He's running driver 182.50, according to http://www.gpugrid.net/hosts_user.php?userid=21516.
Clue?
Now that's a clue! The new on the homepage from July 25th says "The driver required will be at least 185.xx, that is required for CUDA2.2. If 190.xx works for you, you can also install it.", but is not visible any more. Maybe the current "July 31, 2009 | We have updated the Windows application (version 6.67) to cuda2.2" should state a bit more explicitely that driver >= 185 is required now?
MrS
____________
Scanning for our furry friends since Jan 2002 |
|
|
Rob.BSend message
Joined: 18 Apr 09 Posts: 3 Credit: 2,542,440 RAC: 0 Level
Scientific publications
|
Thanks for the pointers.
I had tried a machine restart without any joy. I did not know (or missed) the info about the drivers for CUDA. I will update those this evening when I return from work. Strange though that a driver error should log errors about missing output files.
Rob. |
|
|
|
Strange though that a driver error should log errors about missing output files.
That's not the error message (which can be found if you click the individual tasks under "your account" on the homepage). The "output file absent" just tells you that BOINC didn't find a proper result file to send back to the server. Which is quite logical, if there was an error and no actual computation happened.
MrS
____________
Scanning for our furry friends since Jan 2002 |
|
|
uBronan Send message
Joined: 1 Feb 09 Posts: 139 Credit: 575,023 RAC: 0 Level
Scientific publications
|
I started again to run gpugrid and the 1st i receive :
Name A476-TONI_HERG2-2-40-RND2944_2
Workunit 701833
Created 21 Aug 2009 3:40:24 UTC
Sent 21 Aug 2009 3:57:08 UTC
Received 21 Aug 2009 12:34:42 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status 1 (0x1)
Computer ID 47815
Report deadline 26 Aug 2009 3:57:08 UTC
Run time 171.7344
stderr out <core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
# Device 0: "GeForce 9600 GT"
# Clock rate: 1.65 GHz
# Total amount of global memory: 536543232 bytes
# Number of multiprocessors: 8
# Number of cores: 64
MDIO ERROR: cannot open file "restart.coor"
# Using CUDA device 0
# Device 0: "GeForce 9600 GT"
# Clock rate: 1.65 GHz
# Total amount of global memory: 536543232 bytes
# Number of multiprocessors: 8
# Number of cores: 64
Cuda error: Kernel [pme_fill_charges_accumulate] failed in file 'fillcharges.cu' in line 73 : unspecified launch failure.
</stderr_txt>
]]>
Validate state Invalid
Claimed credit 4557.74189814815
Granted credit 0
application version 6.67
Driver version 190.13 the problem came after i got a update for windows and the machine wanted a reboot |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Update the driver to 190.38 and you will receive the CUDA2.3 application.
gdf |
|
|
K1atOdessaSend message
Joined: 25 Feb 08 Posts: 249 Credit: 387,028,788 RAC: 1,197,795 Level
Scientific publications
|
Update the driver to 190.38 and you will receive the CUDA2.3 application.
gdf
The CUDA2.3 app will potentially fix this issue? I've had most or all of the TONI_HERG WU's error out and a lot of the GIANNA_DOP ones. I just upgraded to 6.6.36 and have 190.38 drivers. I'd love it if the CUDA2.3 app solved that problem. I'll keep an eye on it.
GIANNI_DOP and TONI_HERG errors |
|
|
uBronan Send message
Joined: 1 Feb 09 Posts: 139 Credit: 575,023 RAC: 0 Level
Scientific publications
|
Sorry made mistake i have 190.38 drivers installed and every unit fails after some hours of processing untill now all fail with the same error |
|
|
Beyond Send message
Joined: 23 Nov 08 Posts: 1112 Credit: 6,162,416,256 RAC: 0 Level
Scientific publications
|
Think this WU might be bad?
http://www.gpugrid.net/workunit.php?wuid=720687
|
|
|
K1atOdessaSend message
Joined: 25 Feb 08 Posts: 249 Credit: 387,028,788 RAC: 1,197,795 Level
Scientific publications
|
Think this WU might be bad?
http://www.gpugrid.net/workunit.php?wuid=720687
And another: Bad WU
I've had issues on specific types of WU's that have issues on non-200 series cards. Trouble WU's Quite a few of these WU's had issues on other computers, not just mine. And in most cases the issues were centered around "slower" non-200 series cards, but subsequently finished by a 200-series.
I've let it go for a while, but I might have to be proactive and manually cancel the bad types for my system. |
|
|
Beyond Send message
Joined: 23 Nov 08 Posts: 1112 Credit: 6,162,416,256 RAC: 0 Level
Scientific publications
|
Think this WU might be bad?
http://www.gpugrid.net/workunit.php?wuid=720687
I've had issues on specific types of WU's that have issues on non-200 series cards. Trouble WU's Quite a few of these WU's had issues on other computers, not just mine. And in most cases the issues were centered around "slower" non-200 series cards, but subsequently finished by a 200-series.
Not so with the one listed, it's failed on every card including 4 of the 200-series.
|
|
|
K1atOdessaSend message
Joined: 25 Feb 08 Posts: 249 Credit: 387,028,788 RAC: 1,197,795 Level
Scientific publications
|
Think this WU might be bad?
http://www.gpugrid.net/workunit.php?wuid=720687
I've had issues on specific types of WU's that have issues on non-200 series cards. Trouble WU's Quite a few of these WU's had issues on other computers, not just mine. And in most cases the issues were centered around "slower" non-200 series cards, but subsequently finished by a 200-series.
Not so with the one listed, it's failed on every card including 4 of the 200-series.
Yeah. That WU you got seems to be really nasty. It's a GIANNI_BIND type that I've occasionally had issues with as well.
But not as much as the GIANNI_DOP AND "HERG" ones. I've seen some WU's that have failed on mine (and other non-200 series) and finished fine on the 200-series. I can't guess as to why, but it just seems unusual to see a pattern like this across different machines. Maybe nothing to it, but it would be nice to know if I just can't expect certain WU's to crunch to completion. I could abort them early to let a 200-series finish them sooner (or ideally not receive those type at all).
Example 1
Example 2
Example 3 |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Maybe overclocking.
gdf
|
|
|
K1atOdessaSend message
Joined: 25 Feb 08 Posts: 249 Credit: 387,028,788 RAC: 1,197,795 Level
Scientific publications
|
Maybe overclocking.
gdf
Could be. Maybe I'll try lowering it to stock and run it for a while. Had them OC'd the same way for 8 months. Are those specific WU types more susceptible to OC issues? |
|
|