Huseyin Atahan Inan
Results
2
comments of
Huseyin Atahan Inan
I am also confused about this. I might be missing something obvious but it seems to me that the reference answers are from here: https://github.com/lm-sys/FastChat/blob/main/fastchat/llm_judge/data/mt_bench/reference_answer/gpt-4.jsonl But for some questions, answers...
Hi @infwinston, many thanks for the clarification, I understand. Just an interesting note that when I downloaded pre-generated data via `python3 download_mt_bench_pregenerated.py` I see that actually gpt-4 generates more correct...