ZECMiner icon indicating copy to clipboard operation
ZECMiner copied to clipboard

watchdog does not work

Open Optimusjuju opened this issue 8 years ago • 10 comments

The minor crash at random on random, I would like to know they was your solution?

WIN 10 15.12 AND 16.1.1

Optimusjuju avatar Nov 02 '16 22:11 Optimusjuju

add -f 2 -z 1 to your script

ofpcarnage avatar Nov 02 '16 23:11 ofpcarnage

Merci, Thank's you

Optimusjuju avatar Nov 02 '16 23:11 Optimusjuju

Yeah...same here. All machines have gpus that go to 0S/s and no "auto-fix". Machines: Win10, 6xRX470 @ofpcarnage Adding -f 2 CAUSES EXIT on failure. Why the heck are you recommend that?

DefiantMiner avatar Nov 03 '16 01:11 DefiantMiner

-f 2 actualy helps at given time , just run miner in loop . something like this

title MinerX :start genoil -c pool.whatever.com:XXXX -u wallet.worker -p x -g 0 1 2 X -f 2 (if necesarry add/change platform switch -P 0 or 1) timeout /T 5 goto start

mo35 avatar Nov 03 '16 01:11 mo35

I have it on a respawn timer but since it takes 3-4mins to just load the binaries for 6gpus...I'd be spending more time booting the miner than actually hashing.

DefiantMiner avatar Nov 03 '16 02:11 DefiantMiner

that bat file tends to crash after a few hours. So try this rather.

@echo off :check tasklist /FI "imagename eq genoil.exe" | find /I /N "genoil.exe" >NUL if "%ERRORLEVEL%"=="1" start genoil.exe -c eu1-zcash.flypool.org:3333 -u Address.worker -p x -i 20 -w 64 -P 0 -k zec -f 2 -z 1 timeout /t 2 /nobreak

goto :check

pause

jjziets avatar Nov 03 '16 07:11 jjziets

thats my scrypt, the problem is that genoil sometimes fail, and sometimes get 0 S/s and dont fail, so I restart it every 1200 sec or 20 minutes, one script for each gpu so I can see it, and the commands is for start minimized and dont apear in screen

:restart start /min "GPU0" cmd /C start.bat TIMEOUT /T 3 start /min "GPU1" cmd /C start2.bat TIMEOUT /T 3 start /min "GPU2" cmd /C start3.bat TIMEOUT /T 1200 taskkill /f /im genoil.exe TIMEOUT /T 3 goto restart

pelelademadera avatar Nov 03 '16 14:11 pelelademadera

My have a log file for jy genoil 0.6. it is really random how ot restarts.

started genoil at Thu 11/03/2016 15:44:04.28 started genoil at Thu 11/03/2016 15:46:07.44 started genoil at Thu 11/03/2016 15:50:11.46 started genoil at Thu 11/03/2016 15:51:34.02 started genoil at Thu 11/03/2016 15:53:42.07 started genoil at Thu 11/03/2016 15:55:51.63 started genoil at Thu 11/03/2016 15:56:17.16 started genoil at Thu 11/03/2016 15:58:01.21 started genoil at Thu 11/03/2016 15:58:55.44 started genoil at Thu 11/03/2016 16:03:42.19 started genoil at Thu 11/03/2016 16:08:25.16 started genoil at Thu 11/03/2016 16:10:45.88 started genoil at Thu 11/03/2016 16:11:44.34 started genoil at Thu 11/03/2016 16:14:45.39 started genoil at Thu 11/03/2016 16:18:30.01 started genoil at Thu 11/03/2016 16:18:47.05 started genoil at Thu 11/03/2016 16:19:07.44 started genoil at Thu 11/03/2016 16:21:34.53 started genoil at Thu 11/03/2016 16:25:36.08 started genoil at Thu 11/03/2016 16:26:32.51

On 3 November 2016 at 16:01, pelelademadera [email protected] wrote:

thats my scrypt, the problem is that genoil sometimes fail, and sometimes get 0 S/s and dont fail, so I restart it every 1200 sec or 20 minutes, one script for each gpu so I can see it, and the commands is for start minimized and dont apear in screen

:restart start /min "GPU0" cmd /C start.bat TIMEOUT /T 3 start /min "GPU1" cmd /C start2.bat TIMEOUT /T 3 start /min "GPU2" cmd /C start3.bat TIMEOUT /T 1200 taskkill /f /im genoil.exe TIMEOUT /T 3 goto restart

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/Genoil/ZECMiner/issues/79#issuecomment-258150226, or mute the thread https://github.com/notifications/unsubscribe-auth/ASUwlVdpMx45S73aIDL_Ww4-QegpGPebks5q6ekjgaJpZM4Kn2A_ .

Hannes Zietsman Cell +27 84 407 1414 Fax +27 86 519 1021

jjziets avatar Nov 03 '16 14:11 jjziets

Here's a log of a typical failure mode...6gpus drop to 3 over 10mins...then this happens:

22:03:32 main : zec-sa#0: 0.0S/s zec-sa#1: 25.3S/s zec-sa#2: 25.5S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 21.1S/s total: 72.0S/s 22:03:35 gpu#1: submitting solution 22:03:35 pool : received new job #7260602f8bd574ca844e 22:03:35 pool : not accepted: Stale job! (21) 22:03:35 pool : Retrying in 5 seconds... 22:03:36 gpu#1: submitting solution zec-sa#1write: The file handle supplied is not valid 22:03:36 gpu#1: Mining thread quit 22:03:37 gpu#5: submitting solution zec-sa#5write: The file handle supplied is not valid 22:03:37 gpu#5: Mining thread quit 22:03:37 main : zec-sa#0: 0.0S/s zec-sa#1: 21.3S/s zec-sa#2: 26.9S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 23.7S/s total: 72.0S/s 22:03:40 pool : Connecting to us1-zcash.flypool.org:3333 22:03:40 pool : Connected! 22:03:40 pool : subscribed 22:03:40 pool : received new target: 0x0083126e978d4f... 22:03:41 pool : received new job #6771047e7e2fdf94d7b8 22:03:42 main : 0.0S/s detected! Attempting to restart gpu#1 22:03:42 main : 0.0S/s detected! Attempting to restart gpu#5 22:03:42 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 22.1S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 22.1S/s 22:03:47 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 19.7S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 19.7S/s 22:03:50 gpu#2: submitting solution 22:03:50 pool : submitted and accepted 22:03:52 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 24.1S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 24.1S/s 22:03:57 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 21.3S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 21.3S/s 22:03:58 gpu#2: submitting solution 22:03:58 pool : submitted and accepted 22:04:02 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 19.7S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 19.7S/s

It's on a respawn timer but limping along on ONE gpu for the rest of the period is killing my effective hash rate. Note the watchdogs activate...but do nothing.

Hope this helps diagnose and fix.

Win 10, 6xRX470s, 1840, 4GB, 16.10.1 drivers

DefiantMiner avatar Nov 04 '16 03:11 DefiantMiner

@pelelademadera Stability is very much improved when running 6 separate processes (one per gpu) than running a single process with all 6 gpus. The binary load time is instant with separate processes as well. So, to maximize output, I have the script re-spawning all 6 processes every 15min to catch any strays that may drop off. This seems to be the only way to maximize multi gpu output with the current software.

DefiantMiner avatar Nov 04 '16 20:11 DefiantMiner