acl2024 topic

List acl2024 repositories

UHGEval

182
Stars
17
Forks
Watchers

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

LooGLE

191
Stars
5
Forks
191
Watchers

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

raid

97
Stars
36
Forks
97
Watchers

RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)

timechara

21
Stars
1
Forks
21
Watchers

🧙🏻 Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models"

Cotempqa

32
Stars
1
Forks
32
Watchers

Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)

NewsBench

33
Stars
1
Forks
33
Watchers

[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

KIEval

38
Stars
2
Forks
38
Watchers

[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models

camera

26
Stars
2
Forks
26
Watchers

Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]