reliable-evaluation topic

List reliable-evaluation repositories

xFinder

181
Stars
7
Forks
181
Watchers

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation