Retrieval-based-Voice-Conversion-WebUI icon indicating copy to clipboard operation
Retrieval-based-Voice-Conversion-WebUI copied to clipboard

RuntimeError: [../third_party/gloo/gloo/transport/tcp/pair.cc:534] Connection closed by peer [127.0.0.1]:10895这是什么错误,该如何解决

Open ymmbb8882ymmbb opened this issue 11 months ago • 2 comments

RuntimeError: [../third_party/gloo/gloo/transport/tcp/pair.cc:534] Connection closed by peer [127.0.0.1]:10895这是什么错误,该如何解决

ymmbb8882ymmbb avatar Mar 11 '24 09:03 ymmbb8882ymmbb

我遇到类似的问题,训练过程中这一行代码产生了报错:

scaler.scale(loss_gen_all).backward()

报错信息如下:

RuntimeError: [../third_party/gloo/gloo/transport/tcp/pair.cc:589] Read error [192.168.70.92]:25266: Connection reset by peer

Yaodada12 avatar Apr 10 '24 06:04 Yaodada12

我遇到类似的问题,训练过程中这一行代码产生了报错:

scaler.scale(loss_gen_all).backward()

报错信息如下:

RuntimeError: [../third_party/gloo/gloo/transport/tcp/pair.cc:589] Read error [192.168.70.92]:25266: Connection reset by peer

@RVC-Boss 而且正常是2分钟200 step,但是报错之前跑200个step用了2小时,这是什么鬼。

2024-04-10 04:59:03 | INFO | logs | Total loss at epoch 5, step at 58400, loss=31.05100440979004
2024-04-10 05:01:07 | INFO | logs | Total loss at epoch 5, step at 58600, loss=33.73411178588867
2024-04-10 05:03:10 | INFO | logs | Total loss at epoch 5, step at 58800, loss=32.79433822631836
2024-04-10 07:05:01 | INFO | logs | Total loss at epoch 5, step at 59000, loss=35.7524299621582

Yaodada12 avatar Apr 10 '24 06:04 Yaodada12