Haadi Rakhangi

Results 12 comments of Haadi Rakhangi

Sure! Let me know if there is anything I can assist with.

Thank you for the feedback, @ishaan-jaff. I'll go ahead and add the necessary tests to the PR. Regarding the inclusion of `qdrant_client` as a dependency, we can interact with Qdrant...

Thank you, @ishaan-jaff. I'll proceed with using the REST API and the httpx handler from litellm.

Hi @ishaan-jaff, I've implemented Qdrant Semantic Caching using their REST API and added the 'PUT' method to `http_handler.py`, as it was required for certain actions with Qdrant. You can refer...

@ishaan-jaff Added the two mentioned tests in the `test_caching.py` file. Please go through it and let me know if there are any changes!

> can you share a screenshot of both tests working for you locally ? Sure! Here are the screenshots you asked for: Testing with acompletion: ![image](https://github.com/user-attachments/assets/d55fac05-0efe-4e7f-8d45-6a5dbec968ed) Testing with acompletion +...

Thanks @ishaan-jaff, I'm glad to contribute! 😄

@krrishdholakia could you please assign this issue to me if you believe it would be a valuable contribution to the LiteLLM library?

I have attached a cookbook for my contribution of semantic caching with qdrant vector database. You can refer to it for the results. https://colab.research.google.com/drive/1Lew6xi0ACfIigzfnD_ggWsVIXn-PvaBN?usp=sharing ![image](https://github.com/user-attachments/assets/b026c7cd-d3d2-46a8-8e0c-50fc67e4fcd1) ![image](https://github.com/user-attachments/assets/951fd22f-2d78-491f-af12-5a06805c4b63) I have created a...

Hi @sumitdas66, Thank you for your patience. I have tested and confirmed that the existing code supports the necessary settings without any additional changes. Here is an example the `config.yaml`...