Yongrui Heng
Yongrui Heng
Hi, have you solved the issue? This PR works for me: https://github.com/volcengine/verl/pull/3653
I don't use docker image. I install from custom environment: ``` flashinfer-python==0.2.9rc2 torch==2.7.1 sgl-kernel==0.2.8 sglang==0.4.10.post2 torch_memory_saver==0.0.8 torchao==0.9.0 torchaudio==2.7.1 torchdata==0.11.0 torchvision==0.22.1 xformers==0.0.31 xgrammar==0.1.21 vllm==0.10.1.1 transformers==4.55.4 ``` It might be that my...
Yes, I let the training finish. It seems that the error is still related to https://github.com/volcengine/verl/pull/3653?