Results 6 repositories owned by OpenCompass

opencompass

6.3k
Stars
689
Forks
6.3k
Watchers

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

LawBench

235
Stars
36
Forks
Watchers

Benchmarking Legal Knowledge of Large Language Models

VLMEvalKit

1.1k
Stars
157
Forks
Watchers

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

MixtralKit

759
Stars
81
Forks
Watchers

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

Ada-LEval

49
Stars
2
Forks
Watchers

The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"

ANAH

22
Stars
1
Forks
Watchers

[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2