Google Research Datasets

Results 70 repositories owned by Google Research Datasets

MAVE

127
Stars
19
Forks
Watchers

The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attri...

MultiReQA

29
Stars
3
Forks
Watchers

We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval question answering (ReQA) is the task of retrieving a sentence-level...

natural-questions

862
Stars
153
Forks
Watchers

Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answer...

NewSHead

35
Stars
3
Forks
Watchers

The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.

NewsQuizQA

31
Stars
6
Forks
Watchers

NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news

noun-verb

36
Stars
4
Forks
Watchers

This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.

Nutrition5k

118
Stars
21
Forks
Watchers

Detailed visual + nutritional data for over 5,000 plates of food.

nyt-salience

22
Stars
11
Forks
Watchers

Automatically exported from code.google.com/p/nyt-salience

paws

525
Stars
52
Forks
Watchers

This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identifi...

QED

113
Stars
17
Forks
Watchers

QED: A Framework and Dataset for Explanations in Question Answering