ragas
ragas copied to clipboard
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
**Describe the Feature** Hi @shahules786 @jjmachan, I'm back in Ragas business ^^ I've recently stumble upon a `Failed to parse output. Returning None.` while trying to evaluate the faithfulness of...
- [x] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. **Your Question** what is unclear to you? What would you like to know?...
## 📋 Proposal I'm proposing to refactor the metrics module to create a clearer separation between core abstractions and specific metric implementations. ## 🎯 Motivation Currently, the `src/ragas/metrics/` directory mixes...
# Feature: Optional Handit.ai integration to auto-fix prompts after Ragas evaluations Hi team 👋 Love what you’ve built with Ragas — it’s become a go-to tool for evaluating LLM apps....
[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. **Describe the bug** When I used the evaluate() method to conduct multi-threaded evaluation of my...
[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. **Your Question** The node types in the knowledge graph of ragas are relatively...
I've started using ColPali retrieval and would like to evaluate multimodal retrieval. Does Ragas support that?
[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. My question is: Is it a possibility to generate Synthetic testset without knowledge...
It takes two hours to evaluate two indicators with 500 data records. Is there any method to accelerate the evaluation?
Here is my result:  and my code: ``` import typing as t import asyncio from typing import List from datasets import load_dataset, load_from_disk from ragas.metrics import faithfulness, context_recall, context_precision...