Kyle Gorman comments

Results 217 comments of


                                            Kyle Gorman

trafficstars

Nested tensor warning

Nested tensors are a prototype feature in PyTorch; if certain conditions are met, they are used in the transformers implementation. I believe what actually happens is that it uses this...

Nested tensor warning

One simple possibility is to merge #225, which just silences the scary warning but doesn't otherwise change current behavior, and then investigate #224 at lower priority as an alternative.

Embeddings not being passed to lstm module

I don't understand this report: what's the consequence of this, behaviorally? Is this a design flaw or a bug? Is it an issue you want to assign to yourself? Is...

Embeddings not being passed to lstm module

It seems to me the bug here is just "hard monotonic attention is broken". We already have a shared embedding space (which is a great feature) and we are doomed...

Embeddings not being passed to lstm module

Putting the relevant traceback here for my debugging: ``` File "/home/user/miniconda3/envs/py310/bin/yoyodyne-train", line 8, in sys.exit(main()) File "/home/user/miniconda3/envs/py310/lib/python3.10/site-packages/yoyodyne/train.py", line 423, in main model = get_model_from_argparse_args(args, datamodule) File "/home/user/miniconda3/envs/py310/lib/python3.10/site-packages/yoyodyne/train.py", line 247, in...

Embeddings not being passed to lstm module

> Is there any docs on this? [`nn.Module`](https://pytorch.org/docs/stable/generated/torch.nn.Module.html) and ["modules as building blocks"](https://pytorch.org/docs/stable/notes/modules.html#modules-as-building-blocks). It's easy to tell when the module tracking is broken because there will be no gradients flowing...

Kyle Gorman

Nested tensor warning

Nested tensor warning

Embeddings not being passed to lstm module

Embeddings not being passed to lstm module

Embeddings not being passed to lstm module

Embeddings not being passed to lstm module

Embeddings not being passed to lstm module

adding expert to hp ignore due to pickeling issue

adding expert to hp ignore due to pickeling issue

adding expert to hp ignore due to pickeling issue