Richard Shin issues

Results 13 issues of


                                            Richard Shin

How to cache internal computations?

The README says: > Content store caching is used by default for all external steps, and can be enabled for internal computations by providing suitable serialisation/deserialisation functions. Are there any...

question

`dec_vocab.json` produced by `arxiv-1906.11790v1.jsonnet` has a bunch of numbers in it

Even though all numbers should have been stripped out by disabling the generation of literals.

Arbitrary `prev_action_emb` when pointer maps have multiple entries

For choosing a table in Spider, we allow the model to point to the embedding for any of its columns or the embedding for the table itself: https://github.com/rshin/seq2struct/blob/e69c8eb182ec80770cde4cd369e1b698bc8e921a/seq2struct/models/spider_enc.py#L321-L328 However, when...

Remove dependency on CoreNLP

CoreNLP runs in the JVM in a separate process, which makes it annoying to use. spaCy should be a sufficient replacement for tokenization for looking up GloVe embeddings.

Upgrade to PyTorch 1.2

- [ ] Rewrite https://github.com/rshin/seq2struct/blob/master/seq2struct/models/lstm.py to use TorchScript instead - [ ] Test all major models

`primary_keys` of Spider preprocessing is wrong

Example: ``` In [6]: train_enc = json.loads(next(open('data/spider-20190205/nl2code-0401,output_from=false,emb=glove-42B,min_freq=50/enc/train.jsonl'))) In [7]: train_enc Out[7]: {'column_to_table': {'0': None, '1': 0, '10': 1, '11': 2, '12': 2, '13': 2, '2': 0, '3': 0, '4': 0,...

Richard Shin

How to cache internal computations?

`dec_vocab.json` produced by `arxiv-1906.11790v1.jsonnet` has a bunch of numbers in it

Arbitrary `prev_action_emb` when pointer maps have multiple entries

Remove dependency on CoreNLP

Upgrade to PyTorch 1.2

`primary_keys` of Spider preprocessing is wrong

Improve README for Spider/initial setup procedure

Implement NL2Code training for Hearthstone dataset

Add support for building with Bazel

Remove triple backslashes from cluster.py