Shipeng Wang issues

Repositories
Issues
Comments

Results 5 issues of


                                            Shipeng Wang

Question on Next Utterance Classification task

Hi, In your paper, you said the "next-utterance classification" task and "language modeling" task were trained in a multi-task learning setting, and also in train.py, there is a function load...

Integration of KTOTrainer from trl?

如题，打算引入 trl 库的KTOTrainer吗？

enhancement

pending

Inquiry about the impact of gradient checkpointing on KL divergence estimation.

I am currently working on experiments of DPO and KTO Trainer on private dataset. I am considering using gradient checkpointing to reduce memory usage during backpropagation, but I am unsure...

qlora taining on qwen1.5-15b-chat

训练qwen1.5-14b-chat，遇到下面的报错，transformers==4.38.2 RuntimeError( "Unsloth: Tokenizer's pad_token cannot be = eos_token, and we couldn't find a\n"\ "replacement of either

fixed - pending confirmation

Shipeng Wang

Question on Next Utterance Classification task

Integration of KTOTrainer from trl?

Inquiry about the impact of gradient checkpointing on KL divergence estimation.

qlora taining on qwen1.5-15b-chat

Any plan for the first stable release?