Allan Jie comments

Repositories
Issues
Comments

Results 72 comments of


                                            Allan Jie

[Bug] DeepSpeed Inference Does not Work with LLaMA (Latest verison)

Yes. Removing the `--use_kernel` make it work. Yeah, I realize the DeepSpeed FastGen. Wondering, how does it support the batch size? Or I simply make a for loop about that

Is PPO really better than SFT (in general)? under the condition of same amount of data

I'm really confused that, when you run PPO without SFT, for example, in Narrative QA? How do the (quite-small) model knows it should generate an answer?