acl2024 topic

List acl2024 repositories

UHGEval

182
Stars
17
Forks
Watchers

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

LooGLE

139
Stars
6
Forks
Watchers

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

raid

31
Stars
11
Forks
Watchers

RAID is the largest and most challenging benchmark for machine-generated text detectors. (ACL 2024)

timechara

18
Stars
0
Forks
Watchers

🧙🏻Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models"

Cotempqa

21
Stars
1
Forks
Watchers

Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)

NewsBench

27
Stars
0
Forks
Watchers

[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

KIEval

32
Stars
2
Forks
Watchers

[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models

camera

26
Stars
2
Forks
Watchers

Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]