Matt Watson comments

Results 339 comments of


                                            Matt Watson

Add Token Classification, Text Summarisation, QA Examples

These are things we would like to have, but are not things we will work on right now. Before this, we need to figure out our desired story for pretraining...

BIO/IOB Tagging Text and Vice-a-Versa

I haven't had time to read this paper yet, but open question for me... Do we need the ability to map from "token index spans" to "source text spans" and...

Broken rendering output for IntegerLookup page

I've actually noticed this too for some docs symbols we are about to push out. It seems to happen always after the 10th `>>>` style code block. I think this...

Broken rendering output for IntegerLookup page

Assigning myself to take a closer look.

Add a vocabulary_size argument to WordPieceTokenizer

Some other notes: - If the `vocabulary_size` argument is passed, calling `layer.vocabulary_size()` should always match what was passed. - If the vocabulary file is shorted that the forced vocabulary size,...

Add a vocabulary_size argument to WordPieceTokenizer

Thank you!

Add a vocabulary_size argument to WordPieceTokenizer

@blackhat-coder Any updates on this? This would actually be a useful hyperparmeter to tune in our first [guide](https://keras.io/guides/keras_nlp/transformer_pretraining/) that could help reduce training time.

Add a vocabulary_size argument to WordPieceTokenizer

Thank you! Let me know if there are any question I can help with.

Add a vocabulary_size argument to WordPieceTokenizer

Check out the environment and test running sections of our contributing guide. https://github.com/keras-team/keras-nlp/blob/master/CONTRIBUTING.md#setting-up-an-environment If something is broken or unclear there, let us know!

Decoding Functions Not Working when `jit_compile = True`

Thanks for opening!