llama
llama copied to clipboard
Access to SFT dataset or LLaMA2 SFT models
Hi authors,
First of all, thanks for your great work on LLaMA-2! This is an impressive work for open source large language models!
I have a question about section 3.1 in the paper, specifically "Quality is all you need" section. It mentions that when instruction tuning the base model, you first select 27,540 high quality data examples. Is it possible that you can open source these selected data or the supervised finetuned model, which does not include RLHF?
Thanks!
+1 both the data and sft model would be very useful for researchers.