self-rewarding-lm-pytorch What changes should I make to apply the method on Llama2?

What changes should I make to apply the method on Llama2?

Open Labmem009 opened this issue 11 months ago • 0 comments

I want to apply Self-rewarding and SPIN method on llama2 with alpaca-like finetuning datasets. What changes should I make to apply the method? And what config should I use? Thanks a lot!

Feb 29 '24 11:02 Labmem009

self-rewarding-lm-pytorch self-rewarding-lm-pytorch copied to clipboard

What changes should I make to apply the method on Llama2?

self-rewarding-lm-pytorch
self-rewarding-lm-pytorch copied to clipboard