nlp-datasets topic

List nlp-datasets repositories

chinese_medical_words

100
Stars
32
Forks
Watchers

手工整理医疗行业词汇、术语等语料。可用于语音识别、对话系统等各类nlp模型训练。

MetroTwitter

54
Stars
8
Forks
Watchers

What Twitter reveals about the differences between cities and the monoculture of the Bay Area

FreebaseQA

67
Stars
1
Forks
Watchers

The release of the FreebaseQA data set (NAACL 2019).

OPIEC

36
Stars
6
Forks
Watchers

Reading the data from OPIEC - an Open Information Extraction corpus

infotabs-code

18
Stars
7
Forks
Watchers

Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.

zi-dataset

79
Stars
16
Forks
Watchers

汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。

Datasets

15
Stars
3
Forks
Watchers

datasets with text data for use in NLP, Text analysis, information extraction, ML research.

benchie

38
Stars
8
Forks
Watchers

Comprehensive evaluation framework for Open Information Extraction.

XCSR

22
Stars
2
Forks
Watchers

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

bilkent-turkish-writings-dataset

39
Stars
2
Forks
Watchers

Turkish writings dataset that promotes creativity, content, composition, grammar, spelling and punctuation.