metaseq How to finetune from a consolidated model ?

How to finetune from a consolidated model ?

Open GongZhengLi opened this issue 1 year ago • 1 comments

There are the ways to reshard the trained model to inference model, but how to retrain the model from the consolidated model ? (like llama)

Apr 04 '23 11:04 GongZhengLi

you can convert the consolidated model offline into as many shards as you like using reshard_consolidated.py

Apr 25 '23 16:04 zycalice