dify icon indicating copy to clipboard operation
dify copied to clipboard

support for ollama keep_alive parameter

Open bogan-FMA opened this issue 9 months ago • 1 comments

Self Checks

  • [X] I have searched for existing issues search for existing issues, including closed ones.
  • [X] I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [X] Pleas do not modify this template :) and fill in all the required fields.

1. Is this request related to a challenge you're experiencing?

yes

2. Describe the feature you'd like to see

support the keep_alive feature of ollama, detailed in https://github.com/ollama/ollama/pull/2146

3. How will this feature improve your workflow or experience?

currently the model will unload after 5 min idle, the next reload will take more than 30min for a 70b model.

4. Additional context or comments

no

5. Can you help us with this feature?

  • [ ] I am interested in contributing to this feature.

bogan-FMA avatar Apr 30 '24 11:04 bogan-FMA

Hey @bogan-FMA ,

You can check #4655 this.

Yash-1511 avatar May 25 '24 12:05 Yash-1511