Jules Gagnon-Marchand comments

Results 42 comments of


                                            Jules Gagnon-Marchand

problems at start or end of a sentence where there isn't a space

Added a test to make sure that there is still text to work on, otherwise it would crash.

problems at start or end of a sentence where there isn't a space

Passes all tests. Pull request at https://github.com/ShailChoksi/text2digits/pull/48

Support for a limited vocabulary for generation

bad_word_ids accept ngrams, you could've just tokenized your rejected word list

Support for a limited vocabulary for generation

did you look into using `transformers.Constraint`?

OOM on summarization example

yes people seem to usually just have different heads

OOM on summarization example

I'm trying to get `google/flan-t5-xxl` to run with a single A100 80GB gpu, for seq2seq policy. Is there already a way to set the precision to bfloat16? (I don't see...

OOM on summarization example

Enabling offloading a model from GPU memory to CPU memory when it's not in use would likely be helpful too.

OOM on summarization example

@gabrielhuang have you started doing work like this? (I'm also at Mila)

This is my current approach, indeed, just allowing the user to pass kwargs for `from_pretrained` and `Linear`. Passing `torch_dtype` to `from_pretrained` and `dtype` to `Linear` works. I suppose adding amp...

OOM on summarization example

looks like stable baselines 3 doesn't support bfloat16, because of all the `a_tensor_name.cpu().numpy()` calls. Indeed, doing that with a `bfloat16` tensor leads to an exception, because torch tries to build...