Varun Gupta

Results 87 comments of Varun Gupta

Same error. I am unable to follow run this [script](https://gist.github.com/waleedkadous/4c41f3ee66040f57d34c6a40e42b5969). Unable to find the package installation for LocalHuggingFaceEmbeddings. ``` python3 build_vector_store_fast.py Traceback (most recent call last): File "/opt/tiger/build_vector_store_fast.py", line 11,...

@gangmuk Can you please take a look for prefix-cache-preble - v0.3.0 release. @linjianshu for prefix-cache routing strategy, there were known issues with v0.2.0 release. Please use v0.3.0 release for prefix-cache,...

@linjianshu can you build image from main branch and try it out (after the fix #1147 )

What is the rootcause?

Presently, e2e tests are working fine. I will close this issue.

> Overall code change looks good to me. However, we need some discussion on the model API abstraction. We should come up enough future features and take those insights into...

> After that, any feature relies on statistics won't work out of box and they have to append the stream options right? If that case, Let's update the docs as...

https://github.com/vllm-project/aibrix/issues/790