Message boards : Number crunching : Why am I getting TOO MANY WU's?
Author | Message |
---|---|
I can't seem to get GPUGRID to follow my BOINC Local Preferences. I've got the Minimum Work Buffer set to .02 days, and Max Additional Work Buffer set to .15 days, and I still get seven WU's for my 4 GPU core machines, even when there is 5-9 hours left on the 4 wu's that are running. The resource share is only 62.14%. and my app_config is set to run no more than 4 wu's at a time. Am I missing something? Hope someone can help. Thanks in advance, Rick | |
ID: 32828 | Rating: 0 | rate: / Reply Quote | |
The estimated remaining runtime could well be incorrect (due to the recent short and long test work units all being described as 5M Gflops server side). | |
ID: 32830 | Rating: 0 | rate: / Reply Quote | |
I thought Intel GPUs are listed a resource type that is different than NVIDIA, and so they are naturally not included within any work fetch requests to GPUGrid. | |
ID: 32832 | Rating: 0 | rate: / Reply Quote | |
Maybe development versions of Boinc handle the Intel GPU's differently, but I have an exclude_gpu setting for my intel_gpu for GPUGrid in my cc_config file:
<exclude_gpu> <url>http://www.gpugrid.net/</url> <type>intel_gpu</type> </exclude_gpu> </options>
| |
ID: 32835 | Rating: 0 | rate: / Reply Quote | |
I'd be curious what happens when you take that out, and restart BOINC. If it doesn't behave correctly, maybe we can look at it further to see if a bug existed/exists. | |
ID: 32836 | Rating: 0 | rate: / Reply Quote | |
The estimated remaining runtime could well be incorrect (due to the recent short and long test work units all being described as 5M Gflops server side). Thanks for the feedback, Jacob, & skgiven. BOINC is only showing recognition of my 4 nvidia processors, NOT the intel (btw, I didn't even know mt computer had an intel GPU. Is there a way to use that one too?) I do not have any network restrictions. My wireless network is connected full time to a cable modem, with backup to a second wireless network connected to my backup DSL modem. I'll try to answer any additional questions posted. Thanks, Rick ____________ | |
ID: 32837 | Rating: 0 | rate: / Reply Quote | |
Just after deleting my exclude intel_gpu, and immediately got, | |
ID: 32838 | Rating: 0 | rate: / Reply Quote | |
Right skgiven, you have a resource "intel_gpu" that GPUGrid doesn't explicitly say that it doesn't support (in sched_reply_www.gpugrid.net.xml), and so BOINC asks for work for it. | |
ID: 32839 | Rating: 0 | rate: / Reply Quote | |
Just after deleting my exclude intel_gpu, and immediately got, Jacob, Where can I get the Intel drivers? I'd love to get more out of my machines. Thanks in advance, Rick ____________ | |
ID: 32841 | Rating: 0 | rate: / Reply Quote | |
I think skgiven would have to answer that one. | |
ID: 32842 | Rating: 0 | rate: / Reply Quote | |
Makes sense. | |
ID: 32843 | Rating: 0 | rate: / Reply Quote | |
Rick, see the following thread in Einstein, | |
ID: 32844 | Rating: 0 | rate: / Reply Quote | |
Rick, see the following thread in Einstein, Thanks for pointing me in the right direction, skgiven. Unfortunately, I guess you can't run the Intel HD processor if you already have two dual processor video cards in your box. It cost me $60 and an 80 mile trip to the computer store to find that out. But thanks for trying to help. Regards, Rick ____________ | |
ID: 32863 | Rating: 0 | rate: / Reply Quote | |
Unfortunately, I guess you can't run the Intel HD processor if you already have two dual processor video cards in your box. It cost me $60 and an 80 mile trip to the computer store to find that out. But thanks for trying to help. Regards, Rick Don't see why that would be the situation. Did they mention why, or just charge you $60 and say it can't be done? Have you tried to use the IHD4600 to support your primary monitor? While I don't have two dual GPU's in any system, I do have 3 discrete GPU's in one system and I can use my iHD4000 GPU to run Einstein. ____________ FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help | |
ID: 32870 | Rating: 0 | rate: / Reply Quote | |
Unfortunately, I guess you can't run the Intel HD processor if you already have two dual processor video cards in your box. It cost me $60 and an 80 mile trip to the computer store to find that out. But thanks for trying to help. Regards, Rick The techs just told me they could only get two of the three to work at one time (I think they spent most of their time un-doing the mess I made trying to do it myself). I only have an HDMI output for the IHD4600, and don't know how to connect it to my KVM switch. I use to be knowlageble about these things in the 1960's & 70's , when you had to rewire computers to get them to complete a task. Now technology is moving faster than this old man can keep up with, but I'll keep on trying. Thanks for the nuggets of info. I'll keep following the trail, until I figure this out. Regards, Rick ____________ | |
ID: 32871 | Rating: 0 | rate: / Reply Quote | |
Unfortunately, I guess you can't run the Intel HD processor if you already have two dual processor video cards in your box. It cost me $60 and an 80 mile trip to the computer store to find that out. But thanks for trying to help. Regards, Rick Skgiven, I can't thank you enough for the help getting my IHD to work. Your suggestion to use the IHD as my primary monitor did the trick, and I'm now crunching Einstein@home om it! Now more NVIDIA GPU time for your GPUGRID. Now on to my other two I7 machine and try to getr them to work too. Thanks again, Rick ____________ | |
ID: 32878 | Rating: 0 | rate: / Reply Quote | |
I believe you need a load on the intel VGA (or DVI) port. One way is to plug a monitor in. Another way would be to use a dummy vga plug. The way most bios's are set up is if they don't detect a device on the iGPU and you have another graphics card they simply disable the iGPU | |
ID: 32881 | Rating: 0 | rate: / Reply Quote | |
I believe you need a load on the intel VGA (or DVI) port. One way is to plug a monitor in. Another way would be to use a dummy vga plug. The way most bios's are set up is if they don't detect a device on the iGPU and you have another graphics card they simply disable the iGPU Thanks for the input MarkJ. Luckly, I installed the correct (full) driver directly from Intel's site. I was using the Intel port for my monitor (that was the only way I could get the Intel GPU to work), but maybe I need the dummy plug for drive 0 on my GTX690? I've ordered 3 VGA Dummy Pluggs (1 for eacj of my i7 machines), and hope to receive them by next weekend. It would be great, if that's all that's needed. Thanks again, Rick ____________ | |
ID: 32962 | Rating: 0 | rate: / Reply Quote | |
i have 4 machines running with no monitors and no dummy plugs, 3 of the machine have 2 nvidia each and 1 has 1 amd card. they all work fine. i think only the integrated gpu (intel) needs dummy plug. | |
ID: 32984 | Rating: 0 | rate: / Reply Quote | |
I have my monitor on the iHD4000, and don't use a dummy plug for the 3 discrete GPU's in the same system (660Ti, 660, HD5850). I can still use the iHD4000 at Einstein and the 3 discrete cards to crunch. The only issue is that I cannot launch the NVidia control panel (but I don't really need to anyway - things are working well enough as is). | |
ID: 32991 | Rating: 0 | rate: / Reply Quote | |
Yes.. only Intel did not yet get their heads around the fact that if they want their GPUs to be used as coprocessors (do they?) they should consider the remote possibility someone wants to OpenCL-crunch some nuumbers on them without displaying the result on an attached display. | |
ID: 33021 | Rating: 0 | rate: / Reply Quote | |
That sort of thing is usually down to the BIOS - some systems will insist on disabling the iGPU if there's a discrete card present, others let you chose which has priority. Still others let you chose but then bugger things up anyway. Matt | |
ID: 33027 | Rating: 0 | rate: / Reply Quote | |
That's not what I'm talking about. I'm starting from the point where all GPUs are crunching happily ever after.. until you remove the display from the iGPU. At this point, or when the next WU is supposed to start BOINC can't detect the card as OpenCL device any more because Intels driver sent it to bed. The BIOS has nothing to say in this if you're already in Win and both can run simultaneously in principle. | |
ID: 33074 | Rating: 0 | rate: / Reply Quote | |
I am happy to report successful use of two iCPU's (both 4770's). For me, it required the bios setting for the iGPU to be set to, "Always On", my monitor attached to my GTX690's, AND a VGA Dummy Plug attached to the iGPU. They have been running for 5 hrs now without a problem. So far only SETI Beta wu's have run, but hope to get Einstein wu's soon. I'll report when I get some successfully run. Getting my 3770 iGPU working using the same techniques was a dismal failure. Will keep on trying though. BTW, I'm still having the problem of getting too many GPUGRIP wu's at a time (usually 6 wu's for my 4 cores). hope someone can help me with that too. Thanks to all for the help. Regards, Rick | |
ID: 33151 | Rating: 0 | rate: / Reply Quote | |
I think, my default, you'll get "up to 2 GPUGrid tasks per GPU". <cc_config> <log_flags> <!-- The 3 flags that are on by default are: file_xfer, sched_ops, task --> <file_xfer>1</file_xfer> <file_xfer_debug>0</file_xfer_debug> <sched_ops>1</sched_ops> <sched_op_debug>0</sched_op_debug> <task>1</task> <task_debug>0</task_debug> <unparsed_xml>1</unparsed_xml> <work_fetch_debug>0</work_fetch_debug> <rr_simulation>0</rr_simulation> <rrsim_detail>0</rrsim_detail> <cpu_sched>0</cpu_sched> <cpu_sched_debug>0</cpu_sched_debug> <cpu_sched_status>0</cpu_sched_status> <coproc_debug>1</coproc_debug> <mem_usage_debug>0</mem_usage_debug> <checkpoint_debug>1</checkpoint_debug> <http_debug>0</http_debug> <http_xfer_debug>0</http_xfer_debug> <network_status_debug>0</network_status_debug> <scrsave_debug>1</scrsave_debug> <notice_debug>0</notice_debug> <app_msg_receive>0</app_msg_receive> <app_msg_send>0</app_msg_send> <async_file_debug>0</async_file_debug> <benchmark_debug>0</benchmark_debug> <dcf_debug>0</dcf_debug> <disk_usage_debug>0</disk_usage_debug> <priority_debug>0</priority_debug> <gui_rpc_debug>0</gui_rpc_debug> <heartbeat_debug>0</heartbeat_debug> <poll_debug>0</poll_debug> <proxy_debug>0</proxy_debug> <slot_debug>0</slot_debug> <state_debug>0</state_debug> <statefile_debug>0</statefile_debug> <suspend_debug>0</suspend_debug> <time_debug>0</time_debug> <trickle_debug>0</trickle_debug> </log_flags> <options> <!-- =================================================== TESTING OPTIONS =================================================== --> <!-- <start_delay>20</start_delay> <ncpus>8</ncpus> <exclusive_app>NotepadTest01.exe</exclusive_app> <exclusive_gpu_app>NotepadTest02.exe</exclusive_gpu_app> --> <!-- =================================================== REGULAR OPTIONS =================================================== --> <report_results_immediately>0</report_results_immediately> <fetch_on_update>0</fetch_on_update> <max_event_log_lines>0</max_event_log_lines> <max_file_xfers>10</max_file_xfers> <max_file_xfers_per_project>4</max_file_xfers_per_project> <exclusive_app>iRacingSim.exe</exclusive_app> <exclusive_app>iRacingSim64.exe</exclusive_app> <exclusive_app>Aces.exe</exclusive_app> <exclusive_app>TmForever.exe</exclusive_app> <exclusive_app>TmForeverLauncher.exe</exclusive_app> <!-- ===================================================== SETUP GPUS ====================================================== --> <use_all_gpus>1</use_all_gpus> <!-- =========================================== SETUP GPU 0: GeForce GTX 660 Ti =========================================== --> <!-- <ignore_nvidia_dev>0</ignore_nvidia_dev> --> <!-- Exclude World Community Grid's "Help Conquer Cancer" GPU app (hcc1) on main display - makes graphics slow, even on 660 Ti --> <!-- Commenting out, for now, since this round of hcc1 is completed, and next round may not exhibit the issue. --> <!-- <exclude_gpu> <url>http://www.worldcommunitygrid.org</url> <device_num>0</device_num> <app>hcc1</app> </exclude_gpu> --> <!-- Exclude Einstein/Albert, since work from other GPU projects should give enough work to keep this GPU busy. --> <exclude_gpu> <url>http://einstein.phys.uwm.edu/</url> <device_num>0</device_num> </exclude_gpu> <exclude_gpu> <url>http://albert.phys.uwm.edu/</url> <device_num>0</device_num> </exclude_gpu> <!-- Exclude SETI/Beta, since work from other GPU projects should give enough work to keep this GPU busy. --> <exclude_gpu> <url>http://setiathome.berkeley.edu/</url> <device_num>0</device_num> </exclude_gpu> <exclude_gpu> <url>http://setiweb.ssl.berkeley.edu/beta/</url> <device_num>0</device_num> </exclude_gpu> <!-- Exclude Milkyway@Home, since work from other GPU projects should give enough work to keep this GPU busy. --> <exclude_gpu> <url>http://milkyway.cs.rpi.edu/milkyway/</url> <device_num>0</device_num> </exclude_gpu> <!-- =========================================== SETUP GPU 1: GeForce GTX 460 =========================================== --> <!-- <ignore_nvidia_dev>1</ignore_nvidia_dev> --> <!-- Exclude POEM's "POEM++ OpenCL version" GPU app (poemcl) from a second heterogeneous GPU, since it does not work properly --> <!-- Note: Although 320.18 drivers successfully run smalltest_3, the drivers still do not work right with POEM. --> <!-- Note: Also, it appears that running POEM only on the GTX 460, does not work. So, it must run on the GTX 660 Ti! --> <exclude_gpu> <url>http://boinc.fzk.de/poem/</url> <device_num>1</device_num> <app>poemcl</app> </exclude_gpu> <!-- Reminder: For GPUGrid.net, if going to run 2-tasks-on-1-GPU, exclude this GPU (it only has 1 GB memory) --> <!-- <exclude_gpu> <url>http://www.gpugrid.net</url> <device_num>1</device_num> </exclude_gpu> --> <!-- Exclude Einstein/Albert, since work from other GPU projects should give enough work to keep this GPU busy. --> <exclude_gpu> <url>http://einstein.phys.uwm.edu/</url> <device_num>1</device_num> </exclude_gpu> <exclude_gpu> <url>http://albert.phys.uwm.edu/</url> <device_num>1</device_num> </exclude_gpu> <!-- Exclude SETI/Beta, since work from other GPU projects should give enough work to keep this GPU busy. --> <exclude_gpu> <url>http://setiathome.berkeley.edu/</url> <device_num>1</device_num> </exclude_gpu> <exclude_gpu> <url>http://setiweb.ssl.berkeley.edu/beta/</url> <device_num>1</device_num> </exclude_gpu> <!-- Exclude Milkyway@Home, since work from other GPU projects should give enough work to keep this GPU busy. --> <exclude_gpu> <url>http://milkyway.cs.rpi.edu/milkyway/</url> <device_num>1</device_num> </exclude_gpu> <!-- =========================================== SETUP GPU 2: GeForce GTS 240 =========================================== --> <!-- <ignore_nvidia_dev>2</ignore_nvidia_dev> --> <!-- Exclude World Community Grid's Help Conquer Cancer GPU app --> <!-- GPU not supported per https://secure.worldcommunitygrid.org/help/viewTopic.do?shortName=GPU#610 --> <exclude_gpu> <url>http://www.worldcommunitygrid.org</url> <device_num>2</device_num> <app>hcc1</app> </exclude_gpu> <!-- Exclude POEM's "POEM++ OpenCL version" GPU app (poemcl) from a second heterogeneous GPU, since it does not work properly --> <!-- Also, GPU is not supported, as all tasks immediately error out --> <exclude_gpu> <url>http://boinc.fzk.de/poem/</url> <device_num>2</device_num> <app>poemcl</app> </exclude_gpu> <!-- Exclude GPUGrid.net --> <!-- GPU not supported per http://www.gpugrid.net/forum_thread.php?id=2507 --> <exclude_gpu> <url>http://www.gpugrid.net/</url> <device_num>2</device_num> </exclude_gpu> <!-- Exclude Milkyway@Home --> <!-- GPU not supported, as all tasks immediately error out --> <exclude_gpu> <url>http://milkyway.cs.rpi.edu/milkyway/</url> <device_num>2</device_num> </exclude_gpu> </options> </cc_config> If done successfully, you'll see the exclusions listed towards the beginning of your Event Log when you restart BOINC. For instance, mine says: 9/21/2013 9:13:51 PM | Einstein@Home | Config: excluded GPU. Type: all. App: all. Device: 0 9/21/2013 9:13:51 PM | Albert@Home | Config: excluded GPU. Type: all. App: all. Device: 0 9/21/2013 9:13:51 PM | SETI@home | Config: excluded GPU. Type: all. App: all. Device: 0 9/21/2013 9:13:51 PM | SETI@home Beta Test | Config: excluded GPU. Type: all. App: all. Device: 0 9/21/2013 9:13:51 PM | Milkyway@Home | Config: excluded GPU. Type: all. App: all. Device: 0 9/21/2013 9:13:51 PM | Poem@Home | Config: excluded GPU. Type: all. App: poemcl. Device: 1 9/21/2013 9:13:51 PM | Einstein@Home | Config: excluded GPU. Type: all. App: all. Device: 1 9/21/2013 9:13:51 PM | Albert@Home | Config: excluded GPU. Type: all. App: all. Device: 1 9/21/2013 9:13:51 PM | SETI@home | Config: excluded GPU. Type: all. App: all. Device: 1 9/21/2013 9:13:51 PM | SETI@home Beta Test | Config: excluded GPU. Type: all. App: all. Device: 1 9/21/2013 9:13:51 PM | Milkyway@Home | Config: excluded GPU. Type: all. App: all. Device: 1 9/21/2013 9:13:51 PM | World Community Grid | Config: excluded GPU. Type: all. App: hcc1. Device: 2 9/21/2013 9:13:51 PM | Poem@Home | Config: excluded GPU. Type: all. App: poemcl. Device: 2 9/21/2013 9:13:51 PM | GPUGRID | Config: excluded GPU. Type: all. App: all. Device: 2 9/21/2013 9:13:51 PM | Milkyway@Home | Config: excluded GPU. Type: all. App: all. Device: 2 Good luck! | |
ID: 33153 | Rating: 0 | rate: / Reply Quote | |
Thanks for sharing your setup Jacob. I too use a cc_config to try and exclude my iGPU from the calculations BOINC makes to get GPUGRID work. My buffer settings are similar to yours: .02 minimum days, .23 additional days. I do use project specific app_config.xml's to allow multiple projects to run simaltainiously on my GTX690's. Other than those, I let BOINC get work for all my projects and allow BOINC access to all my computer capability. Still wish I'd only get 1 GPUGRID wu per CPU, until I get to the .02 minimum shown in my preferences, and when it gets work, only get 1 WU (because 1 long wu is longer than .25 days work) also shown in my preferences. Oh well, I'll keep trying, but hate abortinh GPUGRID wu's because they've been sitting in my que unable to run because I got more than 4 wu's. Thanks again, Regards, Rick | |
ID: 33160 | Rating: 0 | rate: / Reply Quote | |
If you want me to do some more research into it, I can. What I'd need you to do is to turn on work_fetch_debug in the cc_config log_flags, then capture a segment where you believe BOINC fetched work erroneously. | |
ID: 33161 | Rating: 0 | rate: / Reply Quote | |
I figured out a reason for overfetch in my BOINC 7.1.18: | |
ID: 33269 | Rating: 0 | rate: / Reply Quote | |
I figured out a reason for overfetch in my BOINC 7.1.18: Yes, I do have GPUGRID set at .75 GPU via app_config, so I can get better utilization from my GTX690's. So, you think BOINC can't figure out how to properly feed the GPU's? Interesting perspective. Thanks for your thoughts MrS. Regards, Rick ____________ | |
ID: 33271 | Rating: 0 | rate: / Reply Quote | |
Rick, just curious: why are you setting 0.75? Any other project running besides GPU-Grid, like POEM in my case? | |
ID: 33310 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : Why am I getting TOO MANY WU's?