nlp-datasets topic

List nlp-datasets repositories

AMICorpusXML

52
Stars
29
Forks
Watchers

Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus

ChatGPT-test-dataset-01

33
Stars
11
Forks
Watchers

a small test dataset for use with OpenAI's ChatGPT

bothub

34
Stars
5
Forks
Watchers

Bothub is an open platform for predicting, training and sharing NLP datasets in multiple languages

4675-scifi

328
Stars
58
Forks
Watchers

chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,...

wula-scifi

83
Stars
19
Forks
Watchers

chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科幻小说网方舟计划存档,中文科幻小说自然语言处理语料库,中文科...

ua-datasets

50
Stars
1
Forks
Watchers

A collection of datasets for Ukrainian language

WikiWhy

41
Stars
1
Forks
Watchers

WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000+ "why" question-answer-rationale triplets.

afrisent-semeval-2023

39
Stars
38
Forks
Watchers

AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/