mrm1001

Results 11 comments of mrm1001

After catch up with @vblagoje : - pick a min of 2 datasets that look interesting - check their license to make sure we can host a processed version somewhere...

Hi Vladimir, could we split both datasets into 2: - one has the deduplicated set of documents ready to be loaded into a RAG pipeline - the other one has...

Thoughts about how to use these datasets for evaluation: ### Pubmed dataset - https://huggingface.co/datasets/vblagoje/PubMedQA_instruction - it has question (”instruction”), context (”paragraph with the right answer”) and response (”right answer”) -...

Also found by @vblagoje : **AllenAI extractive QA dataset** - https://huggingface.co/datasets/allenai/ropes - **How can this dataset be used to show evaluation** - we can use the "situation" and "question" as...

Hi @NILICK, have you tried using a layout-aware library that could help you do this, like using [unstructured.io](https://docs.unstructured.io/open-source/introduction/document-elements)?

Hi there, to anybody in this thread. We are going to host an async office hour next Tuesday in our discord server. Office hours are friendly informal sessions. We would...

Dropping comment to watch this PR.

Does the retriever only return 1 document? If you're on the latest version (2.1.0) pass this to your pipeline: `query_pipeline.run(data, include_outputs_from={"retriever"})`, you can see what gets returned by the retriever.

What I meant is, can you try to see what is returned by the retriever, without adding the filters?

There were some issues that were identified with the current proposed approach: https://github.com/deepset-ai/haystack-cookbook/pull/90#issuecomment-2161170220. The two key issues today are: * user messages are not tracked in memory * the use...