mrm1001 comments

Results 11 comments of


                                            mrm1001

find an industry dataset to showcase evaluation metrics

After catch up with @vblagoje : - pick a min of 2 datasets that look interesting - check their license to make sure we can host a processed version somewhere...

find an industry dataset to showcase evaluation metrics

Hi Vladimir, could we split both datasets into 2: - one has the deduplicated set of documents ready to be loaded into a RAG pipeline - the other one has...

find an industry dataset to showcase evaluation metrics

Thoughts about how to use these datasets for evaluation: ### Pubmed dataset - https://huggingface.co/datasets/vblagoje/PubMedQA_instruction - it has question (”instruction”), context (”paragraph with the right answer”) and response (”right answer”) -...

find an industry dataset to showcase evaluation metrics

Also found by @vblagoje : **AllenAI extractive QA dataset** - https://huggingface.co/datasets/allenai/ropes - **How can this dataset be used to show evaluation** - we can use the "situation" and "question" as...

Building a Vector Database from Specific Text

Hi @NILICK, have you tried using a layout-aware library that could help you do this, like using [unstructured.io](https://docs.unstructured.io/open-source/introduction/document-elements)?

Async Pipelines

Hi there, to anybody in this thread. We are going to host an async office hour next Tuesday in our discord server. Office hours are friendly informal sessions. We would...

feat: Trace pipeline run input/output data

Dropping comment to watch this PR.

QA problem in using QdrantDocumentStore

Does the retriever only return 1 document? If you're on the latest version (2.1.0) pass this to your pipeline: `query_pipeline.run(data, include_outputs_from={"retriever"})`, you can see what gets returned by the retriever.

QA problem in using QdrantDocumentStore

What I meant is, can you try to see what is returned by the retriever, without adding the filters?

Add the memory feature to the library to enable chat + RAG

There were some issues that were identified with the current proposed approach: https://github.com/deepset-ai/haystack-cookbook/pull/90#issuecomment-2161170220. The two key issues today are: * user messages are not tracked in memory * the use...