reliable-evaluation topic
List
reliable-evaluation repositories
xFinder
181
Stars
7
Forks
181
Watchers
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation