litgpt icon indicating copy to clipboard operation
litgpt copied to clipboard

Full finetuning

Open rasbt opened this issue 2 years ago • 1 comments

This is an attempt to implement full finetuning (as opposed to efficient finetuning via adapter etc.)

A script for full finetuning (updating all layers).

Todos

  • [x] Create finetune/full.py script
  • [x] Add how-to guide
  • [x] Create generate/full.py script

rasbt avatar Jun 06 '23 23:06 rasbt

Well, maybe ignore the last commit. Lazy loading works now when you use 1 device, but it now fails when using multiple devices and deepspeed.

The previous commit without lazy loading works with deepspeed and multiple devices. (It needs deepspeed stage 3 and at least 6 GPUs for the finetuning, but it works!)

Maybe that issue with deepspeed & lazy-loading can be circumvented when #118 is finished

rasbt avatar Jun 13 '23 23:06 rasbt