João Lages comments

Results 61 comments of


                                            João Lages

Could we train Implicit Sequence Models in a multi-label scenario?

@maciejkula yeah, that's it. Each time step `i` could have more than one item associated with it. This would be very useful in cases where 'the order of the items...

Difference between user and sequence representation

Ok, I'm starting to understand. I missed [this pad](https://github.com/maciejkula/spotlight/blob/fa3655996dccdd33b419cef86b4e95da5f1196c0/spotlight/sequence/representations.py#L219) that you do to your input. You mask the input so that the task is kind of like 'predicting the last...

Difference between user and sequence representation

While predicting, for me it would make sense if you would use the whole input, non-padded, and the whole output afterwards, not only a portion of it related with the...

Difference between user and sequence representation

Ah ok, I think I finally understood. Only `user_representations` will contain vectors that try to become as clone as the input vector, after passing through the embedding layer. This means...

Difference between user and sequence representation

It'd be cool if we could have a custom hidden layer with that behavior to add more non-linearities and transformations to the model

Difference between user and sequence representation

What's the big advantage over training only one time step at a time? By that means, each `i` would have a single bakprop

Is it expected for the training time to not decrease?

From what I understood, in the GPT-2 experiment, you only changed a [single Conv1D layer](https://github.com/microsoft/LoRA/blob/aa68d8a021c7ba08973e35fdfdc76338fdbfad57/examples/NLG/src/model.py#L95), right? That makes more sense in terms of training speeds.

João Lages

Could we train Implicit Sequence Models in a multi-label scenario?

Difference between user and sequence representation

Difference between user and sequence representation

Difference between user and sequence representation

Difference between user and sequence representation

Difference between user and sequence representation

Is it expected for the training time to not decrease?

Can I run the pretrained models on new databases?

Having 2 optimizers

Having 2 optimizers