Pattaro comments

Repositories
Issues
Comments

Results 5 comments of


                                            Pattaro

有计划支持ReMax吗？

https://github.com/liziniu/ReMax

step3_rlhf_finetuning may needs two tokenizers ?

> Using the same tokenizer for actor and critic in step3 is beneficial. Considering that RM model is easier to train, in step2, I try to use the actor tokenizer...

有计划支持 KTO 吗？

确实很心动，期待作者集成～

Cannot load the previous model weights when using ZeRO 3 optimizer in DeepSpeed Chat

How to solve this problem

Issues with using the released hh dataset.

我也遇到了你解决了吗