Songyang Zhang

Results 223 comments of Songyang Zhang

models is a list of dict. You can evalute multiple models with one config

Thanks for the feature request, we will add this feature into our backlog of Q4. PR are also welcomed! Thanks again.

> Have you ever tested the performance of APIs such as GPT on the human eval dataset, and how did you test it? Please check our documentation for more details.

You can use `--dump-eval-details` currently.https://github.com/open-compass/opencompass/blob/001e77fea236276aa8018b34cd23076145ab1672/run.py#L127 Feel free to re-open if needed.

@White-Friday Please check Flores. Feel free to re-open if needed.

Thanks. 1. The error message indicates that there exists internet connection issue ``` huggingface_hub.utils._errors.LocalEntryNotFoundError: Connection error, and we cannot find the requested files in the cached path. Please try again...

It appears that the inference is functioning correctly, so could you please provide a more detailed description of the bug you are encountering?

Thanks. Would you like to provide an example config, we can try the config to re-implement this issue.

Thanks for the reporting. Please try the prompt template ``` mmlu_gen_23a9a9 mmlu_gen_79e572 ``` We will investigate the influence of the mentioned problem.