llama-recipes Add support to continue pratraining

Add support to continue pratraining

Open hanyin88 opened this issue 1 year ago • 2 comments

🚀 The feature, motivation and pitch

Currently the LLaMA-receipts do not support pretraining out of box. For example, one could not save the pretrained model based on the current checkpoint saving logic which requires run_validation = True, which is not needed during continued pretraining phase. It would be nice to add this feature down the road as it's a common use case in the process of domain adaption.

Thanks.

Alternatives

No response

Additional context

No response

Nov 12 '23 05:11 hanyin88

@hanyin88 Did you find anything for this elsewhere?

Nov 27 '23 13:11 thisisisheanesu

@hanyin88 the original focus of this repo was about fine-tuning, so it has not been designed to support pertaining, but this is something we are investigating as well.

Feb 26 '24 00:02 HamidShojanazeri

llama-recipes llama-recipes copied to clipboard

Add support to continue pratraining

🚀 The feature, motivation and pitch

Alternatives

Additional context

llama-recipes
llama-recipes copied to clipboard