yoyodyne issues

Training UNK tokens

14

Currently we create a vocabulary of all items in all datapaths specified to the training script. However, we may want to study how models perform when provided unknown symbols. In...

Adamits

enhancement

What are people's thoughts on adding preprocessing scripts to allow BPE-like tokenization of characters? Technically we already support this (just tokenize your input and use delineation function). But wonder if...

bonham79

enhancement

Model support: GPT

6

MIght as well set up an autoregressive decoder since T5 is on the docket. This shouldn't be too much of a hassle since the Transformer model works, but leaving as...

bonham79

enhancement

Implement Pytorch Metrics

4

TorchMetrics support is pretty reliable nowadays and makes distributed training less annoying (no more World sizes, yay!). It also syncs well with Wandb logging and allows monitoring of training batch...

bonham79

enhancement

Pointer-generator crashes when source and target are disjoint

12

Traceback (most recent call last): File "/usr/local/lib/python3.10/dist-packages/pytorch_lightning/trainer/call.py", line 38, in _call_and_handle_interrupt return trainer_fn(*args, **kwargs) File "/usr/local/lib/python3.10/dist-packages/pytorch_lightning/trainer/trainer.py", line 650, in _fit_impl self._run(model, ckpt_path=self.ckpt_path) File "/usr/local/lib/python3.10/dist-packages/pytorch_lightning/trainer/trainer.py", line 1112, in _run results =...

Othergreengrasses

bug

Concatenated features break symbol decoding

5

For models where the features are concatenated to the source string, we now handle this in the collator. We simply add the source_token vocabulary length to each feature index in...

Adamits

enhancement

Add caching for transformer inference

3

Transformer inference (i.e. with no teacher forcing) is slow. In practice I think people typically implement some kind of caching so that at each timestep, we do not need to...

Adamits

enhancement

Friendly predictor interface

1

I am working on a simple wrapper class which loads the model and yields predictions, but the interface in `predict.py` is somewhat unfriendly for this...it has nice abstractions but they're...

kylebgorman

enhancement

Add self attention encoder

1

With the decoupling of encoders and decoders, we have added a `Linear` encoder, which seems to just embed the inputs and pass them along. We should also add a `SelfAttention`...

Adamits

enhancement

Move tensor encoding/decoding into the Index class

This is a notice of my plans to move the encoding methods (which take strings and make tensors) and decoding methods (which convert tensors back into strings) into the Index...

kylebgorman

enhancement

yoyodyne
yoyodyne copied to clipboard

Metadata

Training UNK tokens

Subword tokenization

Model support: GPT

Implement Pytorch Metrics

Pointer-generator crashes when source and target are disjoint

Concatenated features break symbol decoding

Add caching for transformer inference

Friendly predictor interface

Add self attention encoder

Move tensor encoding/decoding into the Index class

← Metadata

Owner

Metadata

yoyodyne yoyodyne copied to clipboard

Metadata

← Metadata

Owner

Metadata

yoyodyne
yoyodyne copied to clipboard