langchain-benchmarks icon indicating copy to clipboard operation
langchain-benchmarks copied to clipboard

Is it possible to expand the size of semi-structured reports dataset?

Open applepieiris opened this issue 1 year ago • 0 comments

A lot of RAG based chatbot is based on PDFs like documents, Also how to deal with PDF files is necessary for promoting the performance
of RAG. Thanks in advance for your effort of creating the semi-structured dataset for this senarios. But there is so few of the PDFs and the QA pairs, just 6 docs and 30 QA pairs in total. I see in langchain Simth, the dataset dashboard, most of the experiments are in 100% Faithfulness. Is that possible to expand this benchmark?

applepieiris avatar Sep 10 '24 06:09 applepieiris