Matt Watson

Results 339 comments of Matt Watson

These are things we would like to have, but are not things we will work on right now. Before this, we need to figure out our desired story for pretraining...

I haven't had time to read this paper yet, but open question for me... Do we need the ability to map from "token index spans" to "source text spans" and...

I've actually noticed this too for some docs symbols we are about to push out. It seems to happen always after the 10th `>>>` style code block. I think this...

Assigning myself to take a closer look.

Some other notes: - If the `vocabulary_size` argument is passed, calling `layer.vocabulary_size()` should always match what was passed. - If the vocabulary file is shorted that the forced vocabulary size,...

@blackhat-coder Any updates on this? This would actually be a useful hyperparmeter to tune in our first [guide](https://keras.io/guides/keras_nlp/transformer_pretraining/) that could help reduce training time.

Thank you! Let me know if there are any question I can help with.

Check out the environment and test running sections of our contributing guide. https://github.com/keras-team/keras-nlp/blob/master/CONTRIBUTING.md#setting-up-an-environment If something is broken or unclear there, let us know!