alignment-handbook icon indicating copy to clipboard operation
alignment-handbook copied to clipboard

Using MT-Bench to evaluate zephyr

Open abgoswam opened this issue 1 year ago • 2 comments

In the Readme.md here, it says :

  • Make sure the word zephyr exists in the --model-path argument when generating the model responses...

We should also ensure the word zephyr exists in the --model-id argument

This is because:

  1. in the MT-Bench code, they seem to be passing model_id around. code
  2. They look for word "zephyr" to find the matching adapter . here

This is probably a bug in FastChat.

Nonetheless we should update the README.md here too. Otherwise, people using the alignment-handbook will see low scores on MT-Bench

abgoswam avatar Feb 02 '24 08:02 abgoswam

related to : https://github.com/lm-sys/FastChat/issues/3026

abgoswam avatar Feb 08 '24 20:02 abgoswam

Yeah, also in my experience using the word 'zephyr' in model-id instead of model-path works. In fact, no need to use the word 'zephyr' in model-path at all.

fmguler avatar Feb 28 '24 11:02 fmguler