llama.cpp feature: linearly interpolating one or multiple LoRA with base model

feature: linearly interpolating one or multiple LoRA with base model

Open jon-chuang opened this issue 1 year ago • 2 comments

Once https://github.com/ggerganov/llama.cpp/pull/820 is merged, it would be nice to allow linearly interpolating one or multiple LoRA.

LoRA should be able to be loaded interactively, and interpolation weights also adjusted interactively.

Apr 12 '23 03:04 jon-chuang

@jon-chuang what seems to the blocker currently to make lora adapters swappable?

Jun 13 '23 19:06 ghost

https://github.com/ggerganov/llama.cpp/blob/master/llama.cpp#L2838-L2876

Proposal: Just Prototyping interactive LoRA and showing that it might be valuable Shouldn't be too hard!

Think of it as sharing a python notebook that uses torch, but instead of torch, ggml

Jun 13 '23 20:06 yacineMTB

This issue was closed because it has been inactive for 14 days since being marked as stale.

Apr 11 '24 01:04 github-actions[bot]