Advanced search

Message boards : Number crunching : Error - Exit Status 176 on two GERARD WUs

Author Message
John C MacAlister
Send message
Joined: 17 Feb 13
Posts: 180
Credit: 144,701,536
RAC: 1,539
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 42282 - Posted: 3 Dec 2015 | 11:34:10 UTC

I have experienced two failures on my AMD FX-8350 with two ASUS GTX 660 Ti GPUs. Can someone please help me determine the probable cause?

Thanks : errors shown below:


Name
1x2-GERARD_RSPW_CXCL12_DIMTRIM_PL-0-20-RND4216_0
Workunit
11340963
Created
2 Dec 2015 | 15:55:41 UTC
Sent
2 Dec 2015 | 17:16:40 UTC
Received
3 Dec 2015 | 9:30:32 UTC
Server state
Over
Outcome
Computation error
Client state
Compute error
Exit status
176 (0xb0) Unknown error number
Computer ID
242208
Report deadline
7 Dec 2015 | 17:16:40 UTC
Run time
58,056.40
CPU time
2,060.57
Validate state
Invalid
Credit
0.00
Application version
Long runs (8-12 hours on fastest card) v8.46 (cuda65)
Stderr output
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
process exited with code 176 (0xb0, -80)
</message>
<stderr_txt>
# SWAN Device 0 :
# Name : GeForce GTX 660 Ti
# ECC : Disabled
# Global mem : 2047MB
# Capability : 3.0
# PCI ID : 0000:03:00.0
# Device clock : 1110MHz
# Memory clock : 3304MHz
# Memory width : 192bit
# The simulation has become unstable. Terminating to avoid lock-up (1)
# Attempting restart (step 5440000)

</stderr_txt>
]]>


Name
e25s8_e10s8p1f684-GERARD_FXCXCL12_LIG_002_166_8921-0-1-RND1816_1
Workunit
11339248
Created
1 Dec 2015 | 23:06:58 UTC
Sent
1 Dec 2015 | 23:07:11 UTC
Received
2 Dec 2015 | 7:33:20 UTC
Server state
Over
Outcome
Computation error
Client state
Compute error
Exit status
176 (0xb0) Unknown error number
Computer ID
242208
Report deadline
6 Dec 2015 | 23:07:11 UTC
Run time
30,166.35
CPU time
3,449.24
Validate state
Invalid
Credit
0.00
Application version
Long runs (8-12 hours on fastest card) v8.46 (cuda65)
Stderr output
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
process exited with code 176 (0xb0, -80)
</message>
<stderr_txt>
# SWAN Device 0 :
# Name : GeForce GTX 660 Ti
# ECC : Disabled
# Global mem : 2047MB
# Capability : 3.0
# PCI ID : 0000:03:00.0
# Device clock : 1110MHz
# Memory clock : 3304MHz
# Memory width : 192bit
# Simulation unstable. Flag 10 value 686
# The simulation has become unstable. Terminating to avoid lock-up
# The simulation has become unstable. Terminating to avoid lock-up (2)
# Attempting restart (step 10460000)

</stderr_txt>
]]>

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 669
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42284 - Posted: 3 Dec 2015 | 13:40:00 UTC - in response to Message 42282.

Overclocking too agressively

John C MacAlister
Send message
Joined: 17 Feb 13
Posts: 180
Credit: 144,701,536
RAC: 1,539
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 42285 - Posted: 3 Dec 2015 | 14:31:53 UTC - in response to Message 42284.

Overclocking too agressively


My system builder says my GPUs are not overclocked.....

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 669
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42286 - Posted: 3 Dec 2015 | 15:31:19 UTC - in response to Message 42285.

Overclocking too agressively


My system builder says my GPUs are not overclocked.....


Download GPUZ and see what the default clocks are and what current clocks are.

https://www.techpowerup.com/downloads/2571/techpowerup-gpu-z-v0-8-6/

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1111
Credit: 1,813,512,539
RAC: 953,866
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42339 - Posted: 9 Dec 2015 | 3:27:45 UTC

Yeah, I'd say your GPU clock is too high.

I'd like to know the exact make and model of your GPU, as well as GPU-Z's values for "GPU Clock" and "Default Clock" while a task is being crunched.

John C MacAlister
Send message
Joined: 17 Feb 13
Posts: 180
Credit: 144,701,536
RAC: 1,539
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 42348 - Posted: 9 Dec 2015 | 22:11:14 UTC - in response to Message 42339.

Hi, Jacob:

I am running under Linux Mint 17.2 and will have to look for a GPU-Z equivalent app.

Will be in touch later.

Thanks,

John

Post to thread

Message boards : Number crunching : Error - Exit Status 176 on two GERARD WUs