peft icon indicating copy to clipboard operation
peft copied to clipboard

Fine-tuning a 13B mt0-xxl model

Open QLutz opened this issue 1 year ago • 0 comments

Hello and thanks for the awesome library !

I'd like to reproduce some of the results you display in the repo's README and had a few questions:

  1. I was wondering what you meant by

Also, we didn't use the larger 13B mt0-xxl model

as written in the Use cases section of the README ?

  1. What code did you use for the PEFT-LoRA PyTorch x mt0-xxl datapoint? I'm assuming it's the one from examples/conditional_generation/peft_lora_seq2seq.ipynb but I run into OOM errors when I try on a similar setup (i.e. A100 80GB + fair amount of RAM).

Thanks in advance :)

QLutz avatar Mar 29 '23 11:03 QLutz