Drastic
Results
2
comments of
Drastic
Hi, @andrelmfarias It doesn't seems to be a problem as Ukrainian words are whitespaced too. It's just different alphabet (cyryllic) as you've pointed correctly. And 'bert-base-multilingual-cased' includes it. I even...
Thank you! Looking into it. **Update:** @andrelmfarias you gave a good point. Seems like a major tokenization issue with fine-tuning multilingual BERT model. Additional discussion can be found in [https://github.com/huggingface/transformers/issues/982](https://github.com/huggingface/transformers/issues/982)....