Ishaan Sehgal

Results 10 comments of Ishaan Sehgal

Tuning and Inference combined in same image

Ok I just ran this locally and didnt have any issues. Let's confirm a couple things @rliberoff 1. The correct image is being used. You should git checkout tag 0.3.0...

Hi @rliberoff, thanks for sharing. I recommend reducing the max length to speed up requests for the medium model. Additionally, we're adjusting the deployment specs to utilize all available GPUs,...

Apologies for the delay. You can edit the deployment specification by running the following command: ``` kubectl edit deployment ``` In the configuration, update the resource limits and requests from:...

Being addressed with https://github.com/Azure/kaito/issues/606

prometheus client or other requirements -> requirements.txt

From high level why is LWS needed for RAG?

@hungry1526 Is this error reproducible, and do you have the kaito-pod logs? I couldn’t reproduce it following your steps. A couple of things to check: - Are you adding the...

Let dependabot open a PR for the updated version.