opencompass issues

自定义的模型API都要配置步骤有哪些，除了写config文件外

4

### Describe the feature 自定义的模型API都要配置步骤有哪些，除了写config文件外 ### Will you implement it? - [ ] I would like to implement this feature and create a PR!

zhe123tc

[Feature] MMLU-PRO

1

### Describe the feature Huggingface : https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro?row=0 ### Will you implement it? - [ ] I would like to implement this feature and create a PR!

Ezra-Yu

[Bug] run with transformers==4.40.2, error "HuggingFacewithChatTemplate does not support ppl-based evaluation".

3

### Prerequisite - [X] I have searched [Issues](https://github.com/open-compass/opencompass/issues/) and [Discussions](https://github.com/open-compass/opencompass/discussions) but cannot get the expected help. - [X] The bug has not been fixed in the [latest version](https://github.com/open-compass/opencompass). ### Type...

zhulinJulia24

the prompt for triviaqa dataset

### Describe the feature triviaqa数据集的每条的question本身就有"?"，triviaqa_gen_2121ce.py的prompt中在最后又加了一个"?"。请问该问号是不是可以去掉，还是加上这个问号性能会更好更稳定吗？ ### Will you implement it? - [X] I would like to implement this feature and create a PR!

mengrusun

CMB + Qwen1.5-72B-Chat got empty answers

1

### Prerequisite - [X] I have searched [Issues](https://github.com/open-compass/opencompass/issues/) and [Discussions](https://github.com/open-compass/opencompass/discussions) but cannot get the expected help. - [X] The bug has not been fixed in the [latest version](https://github.com/open-compass/opencompass). ### Type...

qy1026

[Fix] Fix turbomind

Leymore

可以创建一个微信群吗？

1

### 描述该功能题目 ### 是否希望自己实现该功能？ - [ ] 我希望自己来实现这一功能，并向 OpenCompass 贡献代码！

bank010

[Bug] math_gen数据集评估随机失败

4

### 先决条件 - [X] 我已经搜索过 [问题](https://github.com/open-compass/opencompass/issues/) 和 [讨论](https://github.com/open-compass/opencompass/discussions) 但未得到预期的帮助。 - [X] 错误在 [最新版本](https://github.com/open-compass/opencompass) 中尚未被修复。 ### 问题类型我正在使用官方支持的任务/模型/数据集进行评估。 ### 环境 {'CUDA available': True, 'CUDA_HOME': '/usr/local/cuda', 'GCC': 'gcc (Ubuntu 11.3.0-1ubuntu1~22.04.1) 11.3.0', 'GPU...

berton820

[Feature] AGIEval数据集是否可以支持few-shot评测设置？

### 描述该功能 opencompass/opencompass/datasets/agieval/agieval.py 文件显示，目前AGIEval评测仅支持zero-shot设置： ``` def load(path: str, name: str, setting_name: str): from .dataset_loader import load_dataset, load_dataset_as_result_schema assert setting_name in 'zero-shot', 'only support zero-shot setting' dataset_wo_label = load_dataset(name, setting_name, path)...

cabbagecabbage

[Feature] Typos in the official document

1

### Describe the feature Hi, I found some typos in the efficient evaluation of the official document. In the following code snippet: https://github.com/open-compass/opencompass/blob/6c711cb262344b8819894a61f2791d5674e5cf73/docs/en/user_guides/evaluation.md?plain=1#L88-L100 line 97 `task=dict(type=OpenICLEvalTask)` should be `task=dict(type=OpenICLInferTask)`. And...

Galaxy-Husky

opencompass
opencompass copied to clipboard

Metadata

自定义的模型API都要配置步骤有哪些，除了写config文件外

[Feature] MMLU-PRO

[Bug] run with transformers==4.40.2, error "HuggingFacewithChatTemplate does not support ppl-based evaluation".

the prompt for triviaqa dataset

CMB + Qwen1.5-72B-Chat got empty answers

[Fix] Fix turbomind

可以创建一个微信群吗？

[Bug] math_gen数据集评估随机失败

[Feature] AGIEval数据集是否可以支持few-shot评测设置？

[Feature] Typos in the official document

← Metadata

Owner

Metadata

opencompass opencompass copied to clipboard

Metadata

← Metadata

Owner

Metadata

opencompass
opencompass copied to clipboard