Ettore Aquino

Results 3 comments of Ettore Aquino

Any updates on this issue? Exact same behavior as reported by @imperialguy when creating a Lambda Layer running on AWS Lambda Python3.8 runtime

Indeed. It seems that `dic.filter_extremes(keep_n=max_tokens)` is providing a similar functionality as `preprocess_outliers()`, so even if the `preprocess_outliers()` filter is behaving as expected (which I believe it is), once the `filter_extremes()`...

@stijnh, can you assign this issue to me? I'll look into it and try improve the tests for `build_corpus`