corpus-data topic

List corpus-data repositories

poetry

654
Stars
81
Forks
Watchers

汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...

oie-resources

481
Stars
58
Forks
Watchers

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.

vue2-admin-lte

481
Stars
58
Forks
Watchers

:bar_chart: adminLTE to vuejs v2.x converting project

ua-gec

255
Stars
21
Forks
Watchers

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

CEC-Corpus

672
Stars
163
Forks
Watchers

:books:中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室

DANeS

65
Stars
14
Forks
Watchers

DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)

OPIEC

36
Stars
6
Forks
Watchers

Reading the data from OPIEC - an Open Information Extraction corpus

CEEC-Corpus

43
Stars
15
Forks
Watchers

:books:中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室

canto-filter

31
Stars
2
Forks
Watchers

粵文語料篩選器 Cantonese text filter

Datasets

15
Stars
3
Forks
Watchers

datasets with text data for use in NLP, Text analysis, information extraction, ML research.