Devaansh Gupta comments

Results 5 comments of


Devaansh Gupta

Misalignment between documentation and implementation of mBART50 tokenisation for the decoder

Not stale, still looking forward to a response!

Misalignment between documentation and implementation of mBART50 tokenisation for the decoder

Hey! While the output of the tokenizer is correct(both input_ids and labels in the same format), the labels are going to pass through the `shift_tokens_right` to create the `decoder_input_ids`. The...

Misalignment between documentation and implementation of mBART50 tokenisation for the decoder

I agree with @LoicGrobol. I also want to clarify this example from the [docs](https://huggingface.co/docs/transformers/model_doc/mbart#training-of-mbart50): ```python from transformers import MBartForConditionalGeneration, MBart50TokenizerFast article_hi = "संयुक्त राष्ट्र के प्रमुख का कहना है कि...

Misalignment between documentation and implementation of mBART50 tokenisation for the decoder

Got it! I guess the only change then would be in the "labels" from the tokenizer then - where there was LANGID initially, we would have a 1/-100?

Misalignment between documentation and implementation of mBART50 tokenisation for the decoder

Thank you for all the help!