firefox-translations-training icon indicating copy to clipboard operation
firefox-translations-training copied to clipboard

Investigate using LLMs for evaluation

Open eu9ene opened this issue 5 months ago • 1 comments

It would be interesting to compare evaluation capabilities of LLMs to COMET and human evaluation.

See the paper: Large Language Models Are State-of-the-Art Evaluators of Translation Quality

eu9ene avatar Aug 29 '24 23:08 eu9ene