tidytext icon indicating copy to clipboard operation
tidytext copied to clipboard

Text mining using tidy tools :sparkles::page_facing_up::sparkles:

Results 14 tidytext issues
Sort by recently updated
recently updated
newest added

Hello Julia, I am learning a lot from your book and videos. Thank you. I am conducting a text mining analysis with topic modeling. While following your instructions, I have...

Hello, Thank you for a wonderful tool. I have noticed that RAM consumption becomes the computational bottleneck when unnesting tokens from a large corpus, which gets exponentially worse as the...

Some `stop_words` do not belong to the list. For example, there are some `stop_words` that are present in sentiment lexicons: - According to `onix` lexicon "good" is a stop_word. I...

I suggest to add a function to bind BM25 score *(which is based on a probabilistic term weighting model)*. It is useful in some cases as it gives control over:...

feature

I'd love to be able to calculate lexical diversity using tidy principles. I know Quanteda and koRpus already have these functions, but I'd prefer to do it the tidy way....

feature

In the current `tidytext` document explaining about [the tidy approach to `stm` object](https://juliasilge.github.io/tidytext/reference/stm_tidiers.html#examples), there is no specific example of how to add covariates. I wanted to try that out with...

documentation

Hi Julia! I'm big fan of the tidy text mining book, but it seems it does not have too much emphasis on how to tune the number of topics (K)...

feature

Hi, I love learning tidytext but was a bit surprised to see that the get_sentiments() function does not allow to use the non-english translations included within the Nov 2017 nrc...

feature

I think it would be fairly easy to add support for the lsa package to tidytext and broom. See example below. ```r # Put some docs in a vector library("dplyr")...

feature