lm-evaluation-harness icon indicating copy to clipboard operation
lm-evaluation-harness copied to clipboard

Add long context evaluation benchmarks such as LongBench and LEval.

Open txchen-USTC opened this issue 6 months ago • 2 comments

Add long context evaluation benchmarks such as LongBench and LEval.

txchen-USTC avatar Aug 05 '24 08:08 txchen-USTC