curated-transformers icon indicating copy to clipboard operation
curated-transformers copied to clipboard

Optimal Qlora settings

Open KnutJaegersberg opened this issue 2 years ago • 1 comments

In HF transformers, the default setting of qlora does not replicate the qlora of the original paper, leaving valuable performance lying on the ML practitioners street using lib defaults.
One has to apply lora to certain parts of the NN, please see Tweet by Tim Dettmers:

https://twitter.com/Tim_Dettmers/status/1695377756232589459

I guess this has to be customized for each model architecture, sounds like a feature for curated-transformers, to me.

KnutJaegersberg avatar Sep 02 '23 09:09 KnutJaegersberg

Thanks for the suggestion! We hope to look more into training in the coming period and will definitely take this into account.

danieldk avatar Sep 05 '23 20:09 danieldk