corpus-data topic
poetry
汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...
oie-resources
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
vue2-admin-lte
:bar_chart: adminLTE to vuejs v2.x converting project
ua-gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
CEC-Corpus
:books:中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室
DANeS
DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)
OPIEC
Reading the data from OPIEC - an Open Information Extraction corpus
CEEC-Corpus
:books:中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室
canto-filter
粵文語料篩選器 Cantonese text filter
BioMedical-NLP-corpus
Biomedical NLP Corpus or Datasets.
Datasets
datasets with text data for use in NLP, Text analysis, information extraction, ML research.