Jason S. Kessler

Results 26 comments of Jason S. Kessler

Impossible to know what's going on without the data. I'd bet you have a very low value in one which is getting marked as 0 by normcdf due to floating...

Thanks! Right now there's no way to exclude terms during corpus construction. However, after the corpus is constructed, you can easily remove outlying terms. For example: ```python # Remove bigrams...

Hi Nicky, Appreciate the feedback. Are you able to quickly build new Corpus objects when there are updates to documents? It would be somewhat straightforward to have add, say, an...

Hi Scott, Thanks for the note. Due to some expedient but poor design choices I made at the beginning of the project, you need at two categories of text to...

No On Wed, Oct 19, 2022 at 5:22 PM fatihbozdag ***@***.***> wrote: > Is there an update on this issue? Rather than two, I'd like to work on 4 >...

Hi Łukasz, Thanks so much for the PR. It would be great to handle more than bigrams. A few requests before I can merge this: * Is it possible to...

You could stop list after tokenization by running corpus.remove_terms(...). Otherwise, feel free to modify AsianNLP.py to fit your use case. It just ducktypes spaCy’s interface.

Thanks for the bug report. I just made some significant improvements to the topic modeling component in Scattertext. You can not only view documents that match an empath category, but...

That's odd, and thanks for letting me know. I'm looking into this.

Thanks for pinging this again. How do I remedy this?