stract
stract copied to clipboard
Add more scripts to tokenizer with tests
Just like we support Han, Arabic etc. we should support more scripts. I think a full list can be found here.