Google Research Datasets

Results 70 repositories owned by Google Research Datasets

relation-extraction-corpus

55
Stars
9
Forks
Watchers

Automatically exported from code.google.com/p/relation-extraction-corpus

RxR

101
Stars
12
Forks
Watchers

Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and Telu...

screen2words

31
Stars
2
Forks
Watchers

The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of the screen2words models (our paper accepted by UIST'21 will be...

sentence-compression

120
Stars
21
Forks
Watchers

Large corpus of uncompressed and compressed sentences from news articles.

synthetic-fur

45
Stars
4
Forks
Watchers

A procedurally generated synthetic fur dataset with conditional inputs for machine learning and neural rendering.

Taskmaster

161
Stars
50
Forks
Watchers

Please see the readme file as well as our 2019 EMNLP paper linked here -->

TF-IDF-IIF-top100-wordlists

25
Stars
3
Forks
Watchers

These are lists for a variety of languages containing words that are distinctive to each language.

TimeDial

59
Stars
5
Forks
Watchers

Temporal Commonsense Reasoning in Dialog

ToTTo

377
Stars
33
Forks
Watchers

ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: given a Wikipedia table and a set of highlighted table cells, prod...