Arthur
Arthur
Hey! Yes as mentioned before, the positional IDS in GPT2 are not created on the fly contrary to other of our models. A fix is in the makinf, see #21853,...
Yes! Feel free to ping me for an early review if you have any doubts
It's unrelated to this issue, but setting `device_map = "auto"` should suffice
I think you can take this on!
Yes! Static KV cache is not supported but coming soon!
We should be able to close this as #31846 was merged
@haixpham can you rather provide a reproducer?
cc @muellerzr if you can have a look!
I am not getting this on 0.19:
This is expected, the `pre_tokenizer` does not have access to the information about the special tokens, so it will always prepend regardless of whether the first token is a special...