llama icon indicating copy to clipboard operation
llama copied to clipboard

Access to SFT dataset or LLaMA2 SFT models

Open YihanCao123 opened this issue 1 year ago • 1 comments

Hi authors,

First of all, thanks for your great work on LLaMA-2! This is an impressive work for open source large language models!

I have a question about section 3.1 in the paper, specifically "Quality is all you need" section. It mentions that when instruction tuning the base model, you first select 27,540 high quality data examples. Is it possible that you can open source these selected data or the supervised finetuned model, which does not include RLHF?

Thanks!

YihanCao123 avatar Jul 25 '23 21:07 YihanCao123

+1 both the data and sft model would be very useful for researchers.

a-antoniades avatar Nov 06 '23 20:11 a-antoniades