pyreft
pyreft copied to clipboard

Published 20 hours ago •

Reame
Issues

[P1] TGI and vLLM support

Open RonanKMcGovern opened this issue 10 months ago • 7 comments

Are there plans for inference support. This is needed if it's to be used by devs in production.
Is fine tuning much faster than LoRA?

Optimization and backward pass are MUCH faster, but surely forward pass is similar (technically, slightly slower)

Why so many epochs?

I was surprised to see 10-12 epochs in the paper.
in practice with LoRA I find less is more (often just do one epoch with constant LR) because it stops overfitting

Apr 22 '24 23:04 RonanKMcGovern