Matt Watson comments

Results 339 comments of


                                            Matt Watson

Make Changes to `RobertaCustom` Layer

Just because they are exposing these parameters does not mean we need to as well. We do need compatibility of our forward pass and their forward pass. But we don't...

Make Changes to `RobertaCustom` Layer

Agreed we need to handle the differences between XMLR base and XL (it's annoying that their architecture changes between what is supposed to be difference sizes). But maybe let's start...

Cannot load back model with no-op Concatenate layer

Triage notes: we took a look and think that we should actually have concatenate throw an error with only a single input. We will try out a change and see...

get_weights()/set_weights() take too long

Looking at the code here, it seems like you have a `mask_dict` which is static in the context of an individual `model.fit()` call. Is that right? If that is the...

Updated timeseries_dataset_from_array

Took a look at the benchmark. Is looks like `timeseries_dataset_from_array2`, which I guessing is the new version, is actually performing noticeably worse. Is that correct? We will probably not be...

MaskedLMHead embeddings

@AakashKumarNain sorry about the breakage here! For the original question above... would it work to replace the `token_embedding` with a [keras_nlp.layers. ReversibleEmbedding](https://keras.io/api/keras_nlp/modeling_layers/reversible_embedding/), and pass that token embedding to the masked...

Matt Watson

Make Changes to `RobertaCustom` Layer

Make Changes to `RobertaCustom` Layer

Cannot load back model with no-op Concatenate layer

get_weights()/set_weights() take too long

Updated timeseries_dataset_from_array

MaskedLMHead embeddings

MaskedLMHead embeddings

MaskedLMHead embeddings

ONNX Support

`timeseries_dataset_from_array()` - Slide over both `data` and `targets` in Tandem for Sequence-to-Sequence Predictions