Ishaan Sehgal
Ishaan Sehgal
Tuning and Inference combined in same image
Ok I just ran this locally and didnt have any issues. Let's confirm a couple things @rliberoff 1. The correct image is being used. You should git checkout tag 0.3.0...
Hi @rliberoff, thanks for sharing. I recommend reducing the max length to speed up requests for the medium model. Additionally, we're adjusting the deployment specs to utilize all available GPUs,...
Apologies for the delay. You can edit the deployment specification by running the following command: ``` kubectl edit deployment ``` In the configuration, update the resource limits and requests from:...
Being addressed with https://github.com/Azure/kaito/issues/606
prometheus client or other requirements -> requirements.txt
From high level why is LWS needed for RAG?
@hungry1526 Is this error reproducible, and do you have the kaito-pod logs? I couldn’t reproduce it following your steps. A couple of things to check: - Are you adding the...
https://github.com/dependabot recreate
Let dependabot open a PR for the updated version.