vllm icon indicating copy to clipboard operation
vllm copied to clipboard

Add support for ReFT

Open RonanKMcGovern opened this issue 1 year ago • 1 comments

🚀 The feature, motivation and pitch

Motivation is to allow ReFT representations to be applied on the fly during inference, which can be done in a batchwise manner.

this is much faster than applying LoRAs

Alternatives

LoRA is too slow as it requires adapter weights to be added, which increases the number of operations.

Additional context

See https://github.com/stanfordnlp/pyreft/issues/63

RonanKMcGovern avatar Apr 27 '24 11:04 RonanKMcGovern

as a user of pyreft I want to highlight the need for selecting subspaces into a hypothetical PyreftRequest (see https://github.com/stanfordnlp/pyreft/issues/63#issuecomment-2073233538)

chris-aeviator avatar May 15 '24 10:05 chris-aeviator

Any traction on this?

jvlinsta avatar Jul 23 '24 09:07 jvlinsta

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

github-actions[bot] avatar Oct 28 '24 02:10 github-actions[bot]