llama icon indicating copy to clipboard operation
llama copied to clipboard

where is the train file?

Open liukaiyueyuo opened this issue 1 year ago • 1 comments

where is the train file? I want to learn how to train.

liukaiyueyuo avatar Mar 22 '23 07:03 liukaiyueyuo

@liukaiyueyuo hey any luck finding the pretraining docs and scripts ? i am looking forward to actually pretrain the 7B model on my own data on some cloud TPUs would be helpful to find the pretraining scripts

StephennFernandes avatar Mar 28 '23 06:03 StephennFernandes

also interested in training data composition, as it was not mentioned in technical report :)

bokesyo avatar Jul 26 '23 14:07 bokesyo

there's a finetuning script at https://github.com/facebookresearch/llama-recipes/blob/main/llama_finetuning.py which you could adapt for pretraining. Section 2 of the paper (https://arxiv.org/pdf/2307.09288.pdf) has the hyperparams used for pretraining

subramen avatar Sep 06 '23 17:09 subramen