Advanced search

Message boards : Number crunching : ACEMD3 High error rates

Author Message
homer__simpsons
Send message
Joined: 17 Nov 15
Posts: 10
Credit: 101,825,520
RAC: 2,991,015
Level
Cys
Scientific publications
wat
Message 61952 - Posted: 24 Nov 2024 | 13:58:39 UTC

Looking at my host, over 15 task, 6 failed (40%):



I believe this is not expected. I started to see this with the new batch (from 2024-11-17?). I previously paused tasks for ACEMD3, but re-started to run them again.

Luckily they fail early in the compute so there is not too much wasted resources, but it should probably be investingated.

Host: https://www.gpugrid.net/show_host_detail.php?hostid=611890

Paul Forsdick
Send message
Joined: 21 Feb 09
Posts: 1
Credit: 19,955,865
RAC: 355,131
Level
Pro
Scientific publications
wat
Message 61953 - Posted: 24 Nov 2024 | 17:58:52 UTC - in response to Message 61952.

I have the same problem

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 1343
Credit: 7,719,704,297
RAC: 12,123,735
Level
Tyr
Scientific publications
watwatwatwatwat
Message 61956 - Posted: 25 Nov 2024 | 8:04:55 UTC - in response to Message 61953.

I have the same problem

You do not have the same problem referenced in this thread since you've haven't run any acemd3 tasks.

All your errors are the ATMML tasks.

Post to thread

Message boards : Number crunching : ACEMD3 High error rates

//