llm-foundry icon indicating copy to clipboard operation
llm-foundry copied to clipboard

MPT-7B Finetuning Jupyter notebook request

Open GeorvityLabs opened this issue 2 years ago • 3 comments

@vchiley @samhavens @alextrott16 , i was going through the MPT-7B model fine tuning documentation. It is def well written but quite hard to grasp in the first look.

Therefore, I am putting forth this request to create a fine-tuning jupyter notebook, that folks can use to train using their local GPU or cloud GPUs (paid like A100s etc)

It would be great to have the same as a jupyter notebook. From loading the mpt model , to loading the instruction set or combination of instruction sets and then finally running the model fine-tuning.

GeorvityLabs avatar May 13 '23 04:05 GeorvityLabs

@bmosaicml @coryMosaicML @nik-mosaic @abhi-mosaic hope you could have a look into this

GeorvityLabs avatar May 15 '23 10:05 GeorvityLabs

@GeorvityLabs I'm putting together a much more detailed example to illustrate the finetuning workflow. The current plan does not include a notebook, though.

We might opt to set up example notebooks in the future, but I think that will have to be a later decision. We'll certainly consider this request when deciding how to curate examples.

alextrott16 avatar May 16 '23 01:05 alextrott16

@alextrott16 hope you guys can make a blog post on how to fine-tune MPT-7B on alpaca , dolly , oasst kinda datasets, would be of great help!

GeorvityLabs avatar May 16 '23 08:05 GeorvityLabs

@vchiley @samhavens @alextrott16 , i was going through the MPT-7B model fine tuning documentation. It is def well written but quite hard to grasp in the first look.

Therefore, I am putting forth this request to create a fine-tuning jupyter notebook, that folks can use to train using their local GPU or cloud GPUs (paid like A100s etc)

It would be great to have the same as a jupyter notebook. From loading the mpt model , to loading the instruction set or combination of instruction sets and then finally running the model fine-tuning.

One can try-out Q Blocks GPUs if one does not have a local setup or Colab Pro

souvik0306 avatar May 19 '23 15:05 souvik0306

I am going to close this issue for now, but the request for a notebook has been noted. We've added a more concrete, runnable finetuning example that partially addresses this.

alextrott16 avatar May 23 '23 02:05 alextrott16

Also ACKing the request for a blog post.

samhavens avatar May 23 '23 03:05 samhavens

@alextrott16 yup, a blog post regarding instruction fine-tuning would be great.

GeorvityLabs avatar May 23 '23 03:05 GeorvityLabs