self-rewarding-lm-pytorch icon indicating copy to clipboard operation
self-rewarding-lm-pytorch copied to clipboard

What changes should I make to apply the method on Llama2?

Open Labmem009 opened this issue 11 months ago • 0 comments

I want to apply Self-rewarding and SPIN method on llama2 with alpaca-like finetuning datasets. What changes should I make to apply the method? And what config should I use? Thanks a lot!

Labmem009 avatar Feb 29 '24 11:02 Labmem009