OpenCompass
Results
6
repositories owned by
OpenCompass
opencompass
6.3k
Stars
689
Forks
6.3k
Watchers
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
LawBench
235
Stars
36
Forks
Watchers
Benchmarking Legal Knowledge of Large Language Models
VLMEvalKit
1.1k
Stars
157
Forks
Watchers
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
MixtralKit
759
Stars
81
Forks
Watchers
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
Ada-LEval
49
Stars
2
Forks
Watchers
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
ANAH
22
Stars
1
Forks
Watchers
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2