pyreft icon indicating copy to clipboard operation
pyreft copied to clipboard

[P1] TGI and vLLM support

Open RonanKMcGovern opened this issue 10 months ago • 7 comments

  1. Are there plans for inference support. This is needed if it's to be used by devs in production.

  2. Is fine tuning much faster than LoRA?

  • Optimization and backward pass are MUCH faster, but surely forward pass is similar (technically, slightly slower)
  1. Why so many epochs?
  • I was surprised to see 10-12 epochs in the paper.
  • in practice with LoRA I find less is more (often just do one epoch with constant LR) because it stops overfitting

RonanKMcGovern avatar Apr 22 '24 23:04 RonanKMcGovern