tokenizers
tokenizers copied to clipboard
Fix decode
This revert the previous breaking change.
Also add a new ByteLevel
normalizer, which replaces the ByteLevel pre_tokenizer.
Checked that we can add chines / Cyrillic tokens which are properly encoded and decoder.
Fixes #1392