Guanheng George Zhang issues

Results 15 issues of


                                            Guanheng George Zhang

Update tutorial with the new dataset

Hey @bentrevett, Thanks for your tutorial. Since torchtext has updated the datasets with the new abstraction, I'm wondering if you plan to update the tutorial here. One of the users...

Doc string for the examples of datasets

Add the example doc strings to `torchtext.datasets`. ![Screen Shot 2021-03-11 at 9 27 23 PM](https://user-images.githubusercontent.com/6156351/110882897-a0e2b880-82b0-11eb-9945-40368bc63d40.png)

cla signed

Update the format of the raw_datasets.json file to pass FB internal lint check

For FB internal test, the `raw_datasets.json` contents are not valid. Update the format to pass the internal lint check.

cla signed

Add Trec dataset

The legacy Trec dataset was retired in `torchtext.legacy` folder. This one yields the raw text strings.

cla signed

Add Stanford Sentiment Treebank (SST) dataset

The legacy SST was retired in `torchtext.legacy` folder. This one yields the raw text strings.

cla signed

Add Natural Language Inference datasets

The following three datasets have been retired in the `legacy.datasets` folder. We are re-writing these by yielding the raw texts: - SNLI - MatchedMultiNLI ([link](https://www.kaggle.com/c/multinli-matched-open-evaluation)) - MismatchedMultiNLI ([link](https://www.kaggle.com/c/multinli-mismatched-open-evaluation)) Unfortunately, The...

cla signed

Guanheng George Zhang

Update tutorial with the new dataset

Doc string for the examples of datasets

Update the format of the raw_datasets.json file to pass FB internal lint check

Add Trec dataset

Add Stanford Sentiment Treebank (SST) dataset

Add Natural Language Inference datasets

[RFC] Prototype pretrained models in torchtext

Add sentencepiece to BERT model

Add setitem func to torchtext.experimental.vocab.Vocab

Update experimental vectors without unk tensor