acl2024 topics

🧙🏻Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models"

ahnjaewoo

acl2024

benchmark

dataset

dialogue

Cotempqa

21

Stars

1

Forks

Watchers

Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)

zhaochen0110

acl2024

benchmark

large-language-models

nature-language-process

NewsBench

27

Stars

0

Forks

Watchers

[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

IAAR-Shanghai

acl2024

aquila2

baichaun2

benchmark

KIEval

32

Stars

2

Forks

Watchers

[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models

zhuohaoyu

acl2024

explainable-ai

llm

llm-evaluation

camera

26

Stars

2

Forks

Watchers

Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]

CyberAgentAILab

acl2024

advertising

dataset

multimodal