firefox-translations-training Investigate using LLMs for evaluation

Investigate using LLMs for evaluation

Open eu9ene opened this issue 5 months ago • 1 comments

It would be interesting to compare evaluation capabilities of LLMs to COMET and human evaluation.

Aug 29 '24 23:08 eu9ene