RAGAS metrics vs Traditional Metrics

Open iosuAbal opened this issue 7 months ago • 0 comments

Hi!

I would like to know if RAGAS metrics are as reliable as with traditional metrics, especially compared to retrieval metrics such as precision@k, recall@k, MRR, nDCG... Are there any studies/papers about LLMs as a judge that ensure that?

If you had the possibility to choose between LLMs as a judge and traditional metrics, what would you choose? (In the case you want to evaluate a RAG pipeline)

Thanks

May 06 '25 12:05 iosuAbal