Google Research Datasets

Results 70 repositories owned by Google Research Datasets

discofuse

31
Stars
8
Forks
Watchers

Disfl-QA

57
Stars
6
Forks
Watchers

A Benchmark Dataset for Understanding Disfluencies in Question Answering

eev

33
Stars
5
Forks
Watchers

The Evoked Expressions in Video dataset contains videos paired with the expected facial expressions over time exhibited by people reacting to the video content.

eth_py150_open

28
Stars
6
Forks
Watchers

A redistributable subset of the ETH Py150 corpus [https://www.sri.inf.ethz.ch/py150], introduced in the ICML 2020 paper 'Learning and Evaluating Contextual Embedding of Source Code' [https://proceedin...

gap-coreference

222
Stars
82
Forks
Watchers

GAP is a gender-balanced dataset containing 8,908 coreference-labeled pairs of (ambiguous pronoun, antecedent name), sampled from Wikipedia for the evaluation of coreference resolution in practical a...

great

22
Stars
12
Forks
Watchers

The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/forum?id=B1lnbRNtwr]

hiertext

225
Stars
18
Forks
Watchers

The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.

Image-Caption-Quality-Dataset

29
Stars
5
Forks
Watchers

A dataset of crowdsourced ratings for machine-generated image captions

KELM-corpus

207
Stars
10
Forks
Watchers