llama-recipes icon indicating copy to clipboard operation
llama-recipes copied to clipboard

Add support to continue pratraining

Open hanyin88 opened this issue 1 year ago • 2 comments

🚀 The feature, motivation and pitch

Currently the LLaMA-receipts do not support pretraining out of box. For example, one could not save the pretrained model based on the current checkpoint saving logic which requires run_validation = True, which is not needed during continued pretraining phase. It would be nice to add this feature down the road as it's a common use case in the process of domain adaption.

Thanks.

Alternatives

No response

Additional context

No response

hanyin88 avatar Nov 12 '23 05:11 hanyin88

@hanyin88 Did you find anything for this elsewhere?

thisisisheanesu avatar Nov 27 '23 13:11 thisisisheanesu

@hanyin88 the original focus of this repo was about fine-tuning, so it has not been designed to support pertaining, but this is something we are investigating as well.

HamidShojanazeri avatar Feb 26 '24 00:02 HamidShojanazeri