1luik

Results 10 comments of 1luik

FakeTensor(..., device='cuda:0', size=(1, s17, s6), dtype=torch.float16, requires_grad=True) ), GradTrackingTensor(lvl=1, value= FakeTensor(..., device='cuda:0', size=(1024, 101306), dtype=torch.float16) )), **{}): got RuntimeError('a and b must have same reduction dim, but got [s17, s6]...

> 你好 [@1luik](https://github.com/1luik)如果您希望我们进行故障排除,我们需要一个更简化的脚本。一般来说,虽然 Unsloth 的 GRPOTrainer 与 trl 的界面类似,但它们的运行方式有所不同。您添加了很多自定义设置,因此需要找出是哪个自定义设置导致了问题。我建议您首先尝试移除尽可能多的自定义设置,看看 Unsloth 是否能正常工作,然后再逐个添加回去。 No need to rush to fix it; if you don't add import unsloth in the first line, there won't...

I'm extremely frustrated that this bug requires me to restart the server multiple times