lukas Wang
lukas Wang
是怎么指定多卡的
> @LiinXemmon Hi, this is caused by log(0) which will return `inf`, I think you should a very small value to difference of two sentences' reward(like 1e-7), it will help...
Hi, I have same problem here. For my case, during the training stage, there is an error saying that it expected all tensors to be on the same device.
Still got the error saying that `it expected all tensors to be on the same device.cuda:0, cuda:1`
Same with MOSS model here. 2X slower