György Orosz

Results 2 comments of György Orosz

Also, I have doubts whether it is possible enumerate all the stopwords in an agglutiantive language like Finnish (or Hungarian). Did you consider using `spacy`?

Thanks for this detailed report! The tokenizer is implemented in spacy's codebase, so we'll need to send a PR over there which could take some time. Please bear with us...