György Orosz
Results
2
comments of
György Orosz
Also, I have doubts whether it is possible enumerate all the stopwords in an agglutiantive language like Finnish (or Hungarian). Did you consider using `spacy`?
Thanks for this detailed report! The tokenizer is implemented in spacy's codebase, so we'll need to send a PR over there which could take some time. Please bear with us...