PROoshio

Results 1 issues of PROoshio

### Reminder - [X] I have read the README and searched the existing issues. ### System Info 尝试Yi和Qwen2-1.5b模型都存在这个问题 train_batchsize=64 per device batch size=1 / 2 / 4 (设置不同gradient accumulate step保证train_batchsize=64)...

pending