Advanced search

Message boards : Graphics cards (GPUs) : Units failing

Author Message
Ryan Munro
Send message
Joined: 6 Mar 18
Posts: 4
Credit: 73,669,663
RAC: 7,498
Level
Thr
Scientific publications
wat
Message 58929 - Posted: 15 Jun 2022 | 9:14:29 UTC

Can someone have a quick look and let me know the problem here, a few computed fine but most errored out.

https://www.gpugrid.net/results.php?userid=524374&offset=0&show_names=0&state=5&appid=

Also, I have just started the project again on another machine with an Nvidia card and most of the time the card is idle with some second or so long spikes every now and again, is that normal?

Thanks

Ian&Steve C.
Avatar
Send message
Joined: 21 Feb 20
Posts: 664
Credit: 4,815,767,994
RAC: 1,059
Level
Arg
Scientific publications
wat
Message 58930 - Posted: 15 Jun 2022 | 12:08:50 UTC - in response to Message 58929.

Can someone have a quick look and let me know the problem here, a few computed fine but most errored out.

https://www.gpugrid.net/results.php?userid=524374&offset=0&show_names=0&state=5&appid=

Also, I have just started the project again on another machine with an Nvidia card and most of the time the card is idle with some second or so long spikes every now and again, is that normal?

Thanks


likely failing because your GT1030 doesn't have enough GPU memory. these python tasks use a lot of VRAM. GT1030 is probably too weak to run these kinds of tasks unfortunately.

and yes it's normal to see that behavior with your RTX 3090. the app has intermittent GPU use
____________

Ryan Munro
Send message
Joined: 6 Mar 18
Posts: 4
Credit: 73,669,663
RAC: 7,498
Level
Thr
Scientific publications
wat
Message 58931 - Posted: 15 Jun 2022 | 13:32:26 UTC - in response to Message 58930.

Thanks, some other odd behaviour I see on the 3090 machine, it seems to start the WU at 2%, if I pause Boinc and restart later the units elapsed time resets to 0 and the percentage goes back to 2%?

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1419
Credit: 3,513,664,410
RAC: 628,387
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 58932 - Posted: 15 Jun 2022 | 14:04:24 UTC - in response to Message 58931.

The program goes through several stages. The first and second 1% stages are unpacking files from an archive, and don't need to be repeated - progress will move to 2% instantly.

The rest of the run involves the serious work, and the app doesn't work out exactly how far its progressed immediately. If you wait a few seconds or minutes (depending on the speed of the rest of the machine), it should jump back up to where it was before the restart, and continue from there in 0.98% increments.

Post to thread

Message boards : Graphics cards (GPUs) : Units failing

//