opencompass issues

feat:enhanced mbpp process answer patterns

## Motivation 当我在使用mbpp测试Qwen3模型时，原有代码无法解决【BEGIN】```python这种情况下的process情况，故添加该处理patterns ## Modification 调整mbpp patterns ## BC-breaking (Optional) 不会 ## Use cases (Optional) 无 ## Checklist **Before PR**: - [ ] Pre-commit or other linting tools are used...

ShikangPang

[Feature] 使用OpenAISDK请求时并行请求

5

### Describe the feature 假设我有4个显卡，分别使用vllm部署了4个7b的不同的模型，我期望它可以并行请求这些模型。而不是请求完第一个的全部评测，才去请求第二个。 ### Will you implement it? - [ ] I would like to implement this feature and create a PR!

aicodex

[Bug] agieval_math is not multi-choice question.

### 先决条件 - [x] 我已经搜索过 [问题](https://github.com/open-compass/opencompass/issues/) 和 [讨论](https://github.com/open-compass/opencompass/discussions) 但未得到预期的帮助。 - [x] 错误在 [最新版本](https://github.com/open-compass/opencompass) 中尚未被修复。 ### 问题类型我正在使用官方支持的任务/模型/数据集进行评估。 ### 环境 python ### 重现问题 - 代码/配置示例 python ### 重现问题 - 命令或脚本 agieval_gen_617738.py...

bazinga014

[Bug] config file gsm8k_contamination_ppl_ecdd22 has an importing bug

### Prerequisite - [x] I have searched [Issues](https://github.com/open-compass/opencompass/issues/) and [Discussions](https://github.com/open-compass/opencompass/discussions) but cannot get the expected help. - [x] The bug has not been fixed in the [latest version](https://github.com/open-compass/opencompass). ### Type...

linboyang

[fix] Update README.md with a typo

[fix] Update README.md with a typo original: `to the field of factuality` Fixed: `to the field of factuality`

ktwu01

[Feature] Support Resume Mechanism for Interrupted Inference Tasks

1

### Describe the feature Problem Description Currently, when OpenCompass performs large-scale model inference (infer), if a task is interrupted unexpectedly (e.g., due to resource failures, manual termination, etc.), it requires...

ShikangPang

[Bug] 无法提交PR，因为pre-commit-config.yaml中的update-dataset-suffix和update-dataset-suffix-pacakge钩子不兼容windows环境

### Prerequisite - [x] I have searched [Issues](https://github.com/open-compass/opencompass/issues/) and [Discussions](https://github.com/open-compass/opencompass/discussions) but cannot get the expected help. - [x] The bug has not been fixed in the [latest version](https://github.com/open-compass/opencompass). ### Type...

Amatilas

[Bug] OpenICLEvalTask does not combine results split by NumWorkerPartitioner

### Prerequisite - [x] I have searched [Issues](https://github.com/open-compass/opencompass/issues/) and [Discussions](https://github.com/open-compass/opencompass/discussions) but cannot get the expected help. - [x] The bug has not been fixed in the [latest version](https://github.com/open-compass/opencompass). ### Type...

vinvcn

[Config] PHYBench config 问题

### 描述该功能 https://github.com/open-compass/opencompass/blob/5fd489994757e32bd64d8fbf3136bf71498c2a35/opencompass/configs/datasets/PHYBench/phybench_gen.py#L18 看起来 Remember 前面缺少个空格 ### 是否希望自己实现该功能？ - [ ] 我希望自己来实现这一功能，并向 OpenCompass 贡献代码！

BIGWangYuDong

[Feature] 请问主观评测脚本支持用本地模型作为judge模型吗？

### Describe the feature examples/eval_subjective.py 在这个文件中，我把judge_models改为了vllmwithchattemplate的形式，似乎并不能正常评测，alpaca eval的最终输出结果为空。请问主观评测脚本支持用本地模型作为judge模型吗？ ### Will you implement it? - [ ] I would like to implement this feature and create a PR!

KAKSIS

opencompass
opencompass copied to clipboard

Metadata

feat:enhanced mbpp process answer patterns

[Feature] 使用OpenAISDK请求时并行请求

[Bug] agieval_math is not multi-choice question.

[Bug] config file gsm8k_contamination_ppl_ecdd22 has an importing bug

[fix] Update README.md with a typo

[Feature] Support Resume Mechanism for Interrupted Inference Tasks

[Bug] 无法提交PR，因为pre-commit-config.yaml中的update-dataset-suffix和update-dataset-suffix-pacakge钩子不兼容windows环境

[Bug] OpenICLEvalTask does not combine results split by NumWorkerPartitioner

[Config] PHYBench config 问题

[Feature] 请问主观评测脚本支持用本地模型作为judge模型吗？

← Metadata

Owner

Metadata

opencompass opencompass copied to clipboard

Metadata

← Metadata

Owner

Metadata

opencompass
opencompass copied to clipboard