Message boards : Number crunching : extremely high error rates
Author | Message |
---|---|
I am just noticing on the Project Status Page that, expect for GERARD_PLAYMOL_4B80IC6U, all tasks - long runs as well as short runs - have extremely high error rates, some of them even close to 70%. | |
ID: 47028 | Rating: 0 | rate: / Reply Quote | |
I am just noticing on the Project Status Page that, expect for GERARD_PLAYMOL_4B80IC6U, all tasks - long runs as well as short runs - have extremely high error rates, some of them even close to 70%.These error rates are slowly decreasing now, as the new applications are working fine. I guess this may have to do with the recent problem where for about 2 days, all tasks failed right away, and hence were reported back as invalidThat's exactly the only reason for these high error rates. - or are there any other reasons?Note that the error rates always start at high, as failed workunits are returned much faster than successful tasks. The another consequence of this is that the error rates always increase much faster than they decrease. | |
ID: 47038 | Rating: 0 | rate: / Reply Quote | |
the new applications are working fine. ... Well ... Since I have a PC that has GTX 970 alongside 2x GTX 660 Ti (SM3.0).... that means that I'm still failing a lot of tasks, until the app is fixed. So, the "918" app is not running fine... for me. I imagine I'm not the only one. | |
ID: 47041 | Rating: 0 | rate: / Reply Quote | |
Since I have a PC that has GTX 970 alongside 2x GTX 660 Ti (SM3.0).... that means that I'm still failing a lot of tasks, until the app is fixed.Who knows how long it could take. Perhaps you should exclude in your cc_config.xml those GPUs in the meantime. So, the "918" app is not running fine... for me.I copy my method here for you and everybody else: Copy the following to your clipboard: notepad c:\ProgramData\BOINC\cc_config.xml Press Windows key + R, then paste and press enter.If you see an empty file, copy and paste the following text: <cc_config>
<options>
<exclude_gpu>
<url>www.gpugrid.net</url>
<device_num>1</device_num>
<type>NVIDIA</type>
</exclude_gpu>
</options>
</cc_config> The value in the <device_num> section should be adapted to the given system.You can have as many <exclude_gpu> sections in your cc_config.xml as many GPUs you have to disable. If your cc_config.xml already has an <options> section then you should insert the section between the <exclude_gpu> and the </exclude_gpu> tags (including both) right after the <options> tag. Click file -> save and click [save]. If your BOINC manager is running, you should click Options -> read config files. Perhaps you should restart BOINC manager (stop the scientific applications upon exiting). | |
ID: 47058 | Rating: 0 | rate: / Reply Quote | |
Thanks. I'm well aware of <exclude_gpu>. I am the one that requested that David A imlement it into BOINC :) I'm directly responsible for its existence, originally requested to prevent certain apps from running on the primary GPU because they made my display laggy! | |
ID: 47064 | Rating: 0 | rate: / Reply Quote | |
Thanks. I'm well aware of <exclude_gpu>. I am the one that requested that David A imlement it into BOINC :) I'm directly responsible for its existence, originally requested to prevent certain apps from running on the primary GPU because they made my display laggy!I know, I intended this for the others you referred having the same problem. However, I'm not going to use it as a workaround to fix this server issue.This is not a server issue, this is a compiler issue. However it could be avoided by the server if it wouldn't send work for the hosts equipped with GTX 660Ti cards, which policy wouldn't filter out your host, as it has a GTX 970 too, so the server doesn't know about the lesser cards in it. Instead, the tasks will continue to error on my GTX 660 Ti GPUs, until MJH and staff step up to better identify and then fix the issues.I think it's an unnecessary display of protest, as there are enough unsupervised hosts to make the statistics worse. They've hinted at some bug, but did not give appropriate info for anybody to do anything to fix it... So what is the nature of the problem?I don't know, but it should be a nasty one, as GTX 670 & GTX 680 (both CC3.0) is working fine with the new app. | |
ID: 47068 | Rating: 0 | rate: / Reply Quote | |
I am seeking clarification about the CC3.0 SM3 GTX 660 Ti problem. | |
ID: 47070 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : extremely high error rates