unsloth Add support for LISA ?

Add support for LISA ?

Open risedangel opened this issue 10 months ago • 4 comments

Hello any plans to support with LISA ?

Arxiv: https://arxiv.org/pdf/2403.17919.pdf

how it compares in the terms of VRAM usage ? standard 16 bit model fine tuning ?

Mar 28 '24 09:03 risedangel

https://github.com/OptimalScale/LMFlow this tool seems to suppot it.

Mar 28 '24 09:03 risedangel

Oh yep had a discussion with some researchers about this! Speed wise, because the first and the last get updated, the gradients have to be backpropagated to the start, so not that much faster than LoRA.

A big issue is the benchmarks which I'll have to manually check - I'm slightly skeptical of LoRA being "worse" than FT / LISA or LISA being better than full finetuning - very counterintuitive and confusing actually.

Mar 28 '24 17:03 danielhanchen

@danielhanchen hi, any news?

Apr 01 '24 18:04 risedangel

@risedangel No sorry :( Been stuck on fixing bugs

Apr 03 '24 12:04 danielhanchen

unsloth unsloth copied to clipboard

Add support for LISA ?

unsloth
unsloth copied to clipboard