lolMiner-releases icon indicating copy to clipboard operation
lolMiner-releases copied to clipboard

KASPA only mining on Nvidia cards crashing and rebooting

Open mr-pineapple-nz opened this issue 2 years ago • 3 comments

Hi,

Started to mine KAS only on Nvidia cards. After 7-8 hours it crashes and reboot. Not even using OC.

lolminer: 1.62 pool: pool.au.woolypooly.com:3112 nvidia driver: 510.68.02 OS: hiveOS 0.6-219@221115 GPU: RTX 3060, RTX 2080 Ti

Checked the logs and not much help in there:

tail -n 30 ./lolminer_reboot.log
Unrecoverable error by GPU 0.
Please check your OC & UV settings on this card.
Unrecoverable error by GPU 1.
Please check your OC & UV settings on this card.
Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s
DNS over HTTPS resolve failed - switching to standard resolve
Device 0 detected as crashed.
-----------------------------------------------
Statistics (15:33:18); Uptime: 3h 14m 6s
lolMiner 1.62, Nvidia 510.68.02, Api port 44444
Mining: HeavyHash-Kaspa
Connected to:  (21ms latency)

      Name        Speed     Pool  Shares   Best    Eff.
                   Mh/s     Mh/s     A/R  Share  Mh/s/W
GPU 0 RTX 3060     0.00   327.49    88/0  22.5T   0.000
GPU 1 RTX 2080 Ti  0.00   742.44   198/1  24.4T   0.000
---------------------------
Total              0.00  1069.93   286/1  24.4T   0.000

        Power  CCLK  MCLK  Core  Junc   Mem  Fan
            W   MHz   MHz  Temp  Temp  Temp  Pct
GPU 0    42.4  1882  6300    28    38   N/A   50
GPU 1    61.1  1350  5799    34    48    47   75
---------------------------
Total   103.5
-----------------------------------------------
Device 1 detected as crashed.
Closing miner and trying to call external script: ./emergency.sh (--watchdog script)

Thanks in advance,

mr-pineapple-nz avatar Nov 21 '22 06:11 mr-pineapple-nz

Hi,

Could you try to run the miner with that extra parameters:

--cclk 1740, 1350 --mclk 810

That will reduce the Watts and fix the Core to avoid any crash of OC in the GPU1

jgonzis avatar Nov 22 '22 10:11 jgonzis

thanks, testing it....

mr-pineapple-nz avatar Nov 22 '22 22:11 mr-pineapple-nz

Also please check out if the new codes of 1.63 maybe fix this issue.

Lolliedieb avatar Nov 23 '22 13:11 Lolliedieb

Hi,

Could you try to run the miner with that extra parameters:

--cclk 1740, 1350 --mclk 810

That will reduce the Watts and fix the Core to avoid any crash of OC in the GPU1

I've tried this OC and it roughly crashed 3 times in the last 24 hours...a bit less than before.

The version 1.63 is not yet available under hiveos

mr-pineapple-nz avatar Nov 24 '22 19:11 mr-pineapple-nz

Now is avalaible... still remains the problem?

jgonzis avatar Nov 26 '22 15:11 jgonzis

Now is avalaible... still remains the problem?

Testing it for 2 days now. The crashing && rebooting seems to be gone, so it seems the new version fixed that. There was a 40min drop in the has rate today but I haven't checked the logs yet. I think you can close this issue.

Thanks for the support!

mr-pineapple-nz avatar Nov 27 '22 10:11 mr-pineapple-nz

I was too quick, the miner crashed overnight and rebooted. However, now it did after running for 2 days...i will try to dig the logs

mr-pineapple-nz avatar Nov 27 '22 18:11 mr-pineapple-nz

here is the log, it seems at some point it cannot resolve pool DNS

Connected to:  (76ms latency)

      Name         Speed    Pool  Shares   Best    Eff.
                    Mh/s    Mh/s     A/R  Share  Mh/s/W
GPU 0 RTX 3060    310.67  341.22   384/1   3.9T   4.592
GPU 1 RTX 2080 Ti 590.03  656.57   741/3  94.3T   7.305
---------------------------
Total             900.70  997.80  1125/4  94.3T   6.068

        Power  CCLK  MCLK  Core  Junc   Mem  Fan
            W   MHz   MHz  Temp  Temp  Temp  Pct
GPU 0    67.2  1740   810    32    42   N/A   50
GPU 1    80.3  1350   810    39    52    51   75
---------------------------
Total   147.4
-----------------------------------------------
Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s
DNS resolve error - retrying in 5 seconds
Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s
DNS over HTTPS resolve failed - switching to standard resolve
Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s
Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s
DNS resolve error - retrying in 5 seconds
Too many attempts. Switching to failover pool pool.woolypooly.com:3112
DNS over HTTPS resolve failed - switching to standard resolve
Unrecoverable error by GPU 1.
Please check your OC & UV settings on this card.
Unrecoverable error by GPU 0.
Please check your OC & UV settings on this card.
Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s
DNS over HTTPS resolve failed - switching to standard resolve
Device 0 detected as crashed.
-----------------------------------------------
Statistics (17:18:51); Uptime: 7h 2m 32s
lolMiner 1.63, Nvidia 510.68.02, Api port 44444
Mining: HeavyHash-Kaspa
Connected to:  (76ms latency)

      Name        Speed    Pool  Shares   Best    Eff.  Power  CCLK  MCLK  Core  Junc   Mem  Fan
                   Mh/s    Mh/s     A/R  Share  Mh/s/W      W   MHz   MHz  Temp  Temp  Temp  Pct
GPU 0 RTX 3060     0.00  339.99   384/1   3.9T   0.000   23.4  1740   810    25    36   N/A   50
GPU 1 RTX 2080 Ti  0.00  654.19   741/3  94.3T   0.000   19.0  1350   810    29    43    42   75
---------------------------
Total              0.00  994.18  1125/4  94.3T   0.000   42.4
-----------------------------------------------
Device 1 detected as crashed.
Closing miner and trying to call external script: ./emergency.sh (--watchdog script)
Closing miner and trying to call external script: ./emergency.sh (--watchdog script)
Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s

mr-pineapple-nz avatar Nov 28 '22 05:11 mr-pineapple-nz

Yea, woolypool had some issues on the weekend - seems they are on a high load.

Mr.Pineapple @.***> schrieb am Mo., 28. Nov. 2022, 06:46:

here is the log, it seems at some point it cannot resolve pool DNS

Connected to: (76ms latency)

  Name         Speed    Pool  Shares   Best    Eff.
                Mh/s    Mh/s     A/R  Share  Mh/s/W

GPU 0 RTX 3060 310.67 341.22 384/1 3.9T 4.592 GPU 1 RTX 2080 Ti 590.03 656.57 741/3 94.3T 7.305

Total 900.70 997.80 1125/4 94.3T 6.068

    Power  CCLK  MCLK  Core  Junc   Mem  Fan
        W   MHz   MHz  Temp  Temp  Temp  Pct

GPU 0 67.2 1740 810 32 42 N/A 50 GPU 1 80.3 1350 810 39 52 51 75

Total 147.4

Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s DNS resolve error - retrying in 5 seconds Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s DNS over HTTPS resolve failed - switching to standard resolve Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s DNS resolve error - retrying in 5 seconds Too many attempts. Switching to failover pool pool.woolypooly.com:3112 DNS over HTTPS resolve failed - switching to standard resolve Unrecoverable error by GPU 1. Please check your OC & UV settings on this card. Unrecoverable error by GPU 0. Please check your OC & UV settings on this card. Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s DNS over HTTPS resolve failed - switching to standard resolve Device 0 detected as crashed.

Statistics (17:18:51); Uptime: 7h 2m 32s lolMiner 1.63, Nvidia 510.68.02, Api port 44444 Mining: HeavyHash-Kaspa Connected to: (76ms latency)

  Name        Speed    Pool  Shares   Best    Eff.  Power  CCLK  MCLK  Core  Junc   Mem  Fan
               Mh/s    Mh/s     A/R  Share  Mh/s/W      W   MHz   MHz  Temp  Temp  Temp  Pct

GPU 0 RTX 3060 0.00 339.99 384/1 3.9T 0.000 23.4 1740 810 25 36 N/A 50 GPU 1 RTX 2080 Ti 0.00 654.19 741/3 94.3T 0.000 19.0 1350 810 29 43 42 75

Total 0.00 994.18 1125/4 94.3T 0.000 42.4

Device 1 detected as crashed. Closing miner and trying to call external script: ./emergency.sh (--watchdog script) Closing miner and trying to call external script: ./emergency.sh (--watchdog script) Average speed (10s): 0.00 Mh/s | 0.00 Mh/s Total: 0.00 Mh/s

— Reply to this email directly, view it on GitHub https://github.com/Lolliedieb/lolMiner-releases/issues/1784#issuecomment-1328572689, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJS63R5LUHORT24NR2KDVFLWKRBMZANCNFSM6AAAAAASGIVSW4 . You are receiving this because you commented.Message ID: @.***>

Lolliedieb avatar Nov 28 '22 06:11 Lolliedieb

I think you can close this issue. I also changed my OC config.

Btw, why I cannot override the mclk in the parameters? I tried to change it other than 810 but on the stats it seems not changing...like hard coded.

mr-pineapple-nz avatar Feb 26 '23 03:02 mr-pineapple-nz

Like --mclk 810 and admin

jgonzis avatar Feb 26 '23 08:02 jgonzis