alignment-handbook icon indicating copy to clipboard operation
alignment-handbook copied to clipboard

Is there available SFT fine tuning for zephyr-7B families?

Open daehuikim opened this issue 1 year ago • 3 comments

image image

Hello, I am so impressed by your models. I tried fine tuning your models with my data and the evaulation_loss is not optimized as shown in the image above. In particular, the blue line is the llama-13b model, and you can see that the zephyr models are performing worse than the llama models when fine tuning, even though the MT performance is much better. Do you have any idea why this is? The script used in my job is based on basic SFTtrainer example on trl library https://github.com/huggingface/trl/blob/main/examples/scripts/sft.py

Thank you!

daehuikim avatar Oct 31 '23 01:10 daehuikim