GPT2-Chinese icon indicating copy to clipboard operation
GPT2-Chinese copied to clipboard

多gpu报错:Input tensor at index 2 has invalid shape

Open tuzeao opened this issue 2 years ago • 0 comments

python3.8,环境同requirements.txt 当使用多个gpu训练时候,会报错 RuntimeError: Input tensor at index 2 has invalid shape [2, 10, 12, 1024, 64], but expected [2, 11, 12, 1024, 64]

出错行显示在模型推理那行 outputs = model.forward(input_ids=batch_inputs, labels=batch_inputs)

这就很奇怪了,明明input_ids和labels完全是一个东西,却会报shape invalid

尝试性用了单gpu,发现代码能跑了

试了一些常规方法,没发现问题原因

tuzeao avatar Dec 26 '22 08:12 tuzeao