Arthur
Arthur
Hey! The `tokenizer_class` that was set in the configuration.json is wrong as the `IndoNLGTokenizer` does not exist in transformers. You should try to ask the other of the model on...
@muellerzr is there a PR or Issue we can track and link here?
Thanks for raising! given that we had #27020, this should be fairly easy to fix! cc @younesbelkada
Will merge the PR today
The previous tests were passing but the tokenizer was `slow` were it should have been `fast` 😅
cc @ydshieh the failing tests are related to the length of the dictionary of the tokenizer. Spaces are encoded to `222`, which is then passed to the model, while the...
The problem is not from the Fast tokenizer (it is a GPT2 tokenizer) but the tiny config test. GPT2TokenizerFast is pretty much full proof at this points. I was just...
As @ydshieh said it might have been fixed, the problem is that the CI is still kind of stuck/not working ... not really sure if it is only this PR,...
I will run them locally 😉