nlp-datasets topic

List nlp-datasets repositories

HistSumm

68
Stars
6
Forks
Watchers

Code and data for "Summarising Historical Text in Modern Languages" (EACL 2021)

nlp-library

1.1k
Stars
91
Forks
Watchers

curated collection of papers for the nlp practitioner 📖👩‍🔬

multi-task-NLP

363
Stars
54
Forks
Watchers

multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.

kartaslov

352
Stars
49
Forks
Watchers

Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по орфографическим ошибкам и опечаткам.

ua-gec

255
Stars
21
Forks
Watchers

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

TriggerNER

173
Stars
19
Forks
Watchers

TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)

CommonGen

137
Stars
23
Forks
Watchers

A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning

VDCNN

171
Stars
41
Forks
Watchers

Implementation of Very Deep Convolutional Neural Network for Text Classification

nlp-public-dataset

341
Stars
74
Forks
Watchers

Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集