Yaowei Zheng

Results 651 comments of Yaowei Zheng

Try reinstall trl

We recommend using EasyR1 for RL, the RL implementation in LlamaFactory is temporarily bugged: https://github.com/hiyouga/EasyR1

这个应该影响不大吧,去除会有问题

训练时候没有添加 `resize_vocab` 参数,tokenizer 不识别

@lifeng7777 modules_to_save 里面不是有吗