Zoltan Fedor comments

Results 78 comments of


                                            Zoltan Fedor

Proposal: Hybrid Search (combining Vector + Sparse search)

Hah, I just linked from my old issue to your newly created one, as I thought maybe the one you created will get more attention than mine. Hah :-)

Proposal: Hybrid Search (combining Vector + Sparse search)

@etiennedi , I am getting confused, is the filtering with BM25 is already supported? I was just looking at my Haystack code and saw that just a few days ago...

Proposal: Hybrid Search (combining Vector + Sparse search)

Okey, then that Haystack PR cannot be correct. That is odd, as it has a unittest which I wrote back in July which catches the error thrown by Weaviate and...

Proposal: Hybrid Search (combining Vector + Sparse search)

Hi @etiennedi , As suspected, that Haystack PR was wrong, incorrectly assumed that Weaviate now supports filters with BM25 (and also included a bug causing it in reality run an...

T5 Model with TensorRT - Runtime on GPU

The note from the KV-cache implementation on BART states: _"Note: current implementation of K-V cache does not exhibit performance gain over the non K-V cache TensorRT version. Please consider to...

T5 Model with TensorRT - Runtime on GPU

We are very much looking forward to that! Hopefully that also applies to the scenario of the OP - large inputs to T5 models.

T5 Model with TensorRT - Runtime on GPU

> While waiting for this update, we started using NVIDIA's FasterTransformer library instead. It has a highly optimized T5 GPU runtime with KV cache supported and it's 5-10x faster than...

add support for haystack

Also Haystack could be used for model serving - as a replacement for OpenAI for those who want to server their own LLM. Haystack pipelines integrate with Ray Serve to...

add support for haystack

Yeah, I do use haystack pipelines with nodes acting as clients for NVIDIA Triton for serving the LLMs locally / building langhchain tools for the agent blazing fast.

Aborting queries where the PHP script gets aborted

@notkriswagner, Thanks. I have actually never tested by killing a PHP script. No, I have Apache - PHP 7.2 (mod_php7) and when http calls which execute a Snowflake query get...