Advanced search

Message boards : Number crunching : Missing ready WU's in my Status Listing

Author Message
Killersocke
Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 44373 - Posted: 1 Sep 2016 | 11:54:59 UTC

Greets,

since end of August i finished 2 WU's.
Task 15260175
WU 11704740
e1s3_0-GERARD_FXCXCL12RX_649283_1-0-1-RND5813_1
Send 29 Aug 2016 | 18:29:33 UTC

Task 15264666
WU 11708582
e24s252_e4s64p0f396-GIANNI_D3C36bCHL1-0-1-RND5240_2
Send 1 Sep 2016 | 10:55:40 UTC

But i see no receive messages in my results and
no reporting about these WU's in my local logfile.

Are these Ghosts at work?
Any Ghostbusters there to solve the problems?

Regards

-------job_log--->
1471135113 ue 31859.200956 ct 27494.250000 fe 5000000000000000 nm e36s8_e32s2p0f50-GERARD_FXCXCL12RX_1167531_1-0-1-RND7553_0 et 43353.775185 es 0
1471338458 ue 31859.200956 ct 56975.970000 fe 5000000000000000 nm e3s4_e1s26p0f688-GIANNI_D3C36bCHL1-0-1-RND1485_0 et 102991.802909 es 0
1471449864 ue 31859.200956 ct 29383.090000 fe 5000000000000000 nm e38s20_e3s4p0f341-GERARD_FXCXCL12RX_629461_2-0-1-RND1780_0 et 43372.013410 es 0
1471536750 ue 31859.200956 ct 27106.750000 fe 5000000000000000 nm e24s54_e5s22p0f221-GERARD_MO_MOR_2-0-1-RND5829_3 et 49477.585356 es 0
1471604370 ue 31859.200956 ct 20369.230000 fe 5000000000000000 nm e24s349_e2s20p0f408-ADRIA_2OV5_CONF_ASP1-0-1-RND2231_0 et 38419.386838 es 0
1472027764 ue 32296.546681 ct 29374.830000 fe 5000000000000000 nm e1s32_3-GERARD_FXCXCL12RX_1041734_1-0-1-RND5428_0 et 43980.238071 es 0
1472082759 ue 32296.546681 ct 22223.660000 fe 5000000000000000 nm e2s11_e1s6p0f332-ADRIA_2OV5_CONF_CLOSED1-0-1-RND9193_0 et 40345.447290 es 0
1472158344 ue 32296.546681 ct 29151.190000 fe 5000000000000000 nm e1s19_3-GERARD_FXCXCL12RX_125902_2-0-1-RND0689_0 et 43271.278313 es 0
1472290267 ue 32296.546681 ct 22381.130000 fe 5000000000000000 nm e4s34_e1s36p0f460-GIANNI_A2A-0-1-RND5051_0 et 48776.628363 es 0
1472392666 ue 32296.546681 ct 25008.990000 fe 5000000000000000 nm e1s1_0-GERARD_FXCXCL12RX_617412_2-0-1-RND8685_0 et 41137.014603 es 0
1472478914 ue 32296.546681 ct 26564.690000 fe 5000000000000000 nm e1s6_1-GERARD_FXCXCL12RX_629809_1-0-1-RND7698_0 et 41299.511654 es 0
1472587585 ue 32296.546681 ct 27135.830000 fe 5000000000000000 nm e1s5_0-GERARD_FXCXCL12RX_680569_2-0-1-RND5067_0 et 43088.872862 es 0
1472728701 ue 32296.546681 ct 24127.430000 fe 5000000000000000 nm e1s7_1-GERARD_FXCXCL12RX_783477_2-0-1-RND4621_0 et 40102.659794 es 0

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1620
Credit: 8,832,166,430
RAC: 19,849,425
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 44394 - Posted: 2 Sep 2016 | 9:55:08 UTC - in response to Message 44373.
Last modified: 2 Sep 2016 | 9:56:16 UTC

Are these Ghosts at work?

There was certainly one at work last night:

01-Sep-2016 21:01:56 [GPUGRID] [sched_op] Starting scheduler request
01-Sep-2016 21:01:56 [GPUGRID] Sending scheduler request: To fetch work.
01-Sep-2016 21:01:56 [GPUGRID] Requesting new tasks for NVIDIA GPU
01-Sep-2016 21:01:56 [GPUGRID] [sched_op] CPU work request: 0.00 seconds; 0.00 devices
01-Sep-2016 21:01:56 [GPUGRID] [sched_op] NVIDIA GPU work request: 32.31 seconds; 0.00 devices
01-Sep-2016 21:01:56 [GPUGRID] [sched_op] Intel GPU work request: 0.00 seconds; 0.00 devices
01-Sep-2016 21:03:02 [GPUGRID] Scheduler request failed: Timeout was reached
01-Sep-2016 21:03:02 [GPUGRID] Sending scheduler request: To fetch work.
01-Sep-2016 21:03:02 [GPUGRID] Requesting new tasks for NVIDIA GPU
01-Sep-2016 21:03:02 [GPUGRID] [sched_op] CPU work request: 0.00 seconds; 0.00 devices
01-Sep-2016 21:03:02 [GPUGRID] [sched_op] NVIDIA GPU work request: 32.31 seconds; 0.00 devices
01-Sep-2016 21:03:02 [GPUGRID] [sched_op] Intel GPU work request: 0.00 seconds; 0.00 devices
01-Sep-2016 21:03:07 [GPUGRID] Scheduler request completed: got 1 new tasks

Host 45218 was allocated two tasks close to that time:

15265184 11712617 1 Sep 2016 | 19:58:31 UTC 6 Sep 2016 | 19:58:31 UTC In progress --- --- --- Long runs (8-12 hours on fastest card) v8.48 (cuda65)
15265078 11712516 1 Sep 2016 | 19:58:29 UTC 6 Sep 2016 | 19:58:29 UTC In progress --- --- --- Long runs (8-12 hours on fastest card) v8.48 (cuda65)

Any Ghostbusters there to solve the problems?

I am worried that the project server clock seems to be some 4.5 minutes behind mine - my logs are UTC+1, which accounts for the larger difference, but there's still an anomaly. I'm posting from the machine in question, and I can see that the local clock is synchronised within about 10 seconds of universal time.

Hooking the server up to an NTP time source might be a wise move and reduce some of the issues the project is facing.

Edit - I posted that at 10:59, and this edit a few seconds after 11:00,UTC+1

Killersocke
Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 44409 - Posted: 2 Sep 2016 | 21:31:02 UTC

many thanks for your information.
So i am happy it was no problem on my side :-)

Post to thread

Message boards : Number crunching : Missing ready WU's in my Status Listing

//