Kenneth Benoit
Kenneth Benoit
I encountered this character when working with some Twitter data: [NARROW NO-BREAK SPACE](https://www.fileformat.info/info/unicode/char/202f/index.htm) There could be better ways to work around this but this provides a simple fix if users...
`fcm()` objects created with the default of `tri = TRUE` produce misleading feature frequencies when using `topfeatures()`, which calls the inherited method for dfm objects that simply sums the columns....
``` r library("quanteda") ## Package version: 3.0.9000 ## Unicode version: 13.0 ## ICU version: 69.1 ## Parallel computing: 12 of 12 threads used. ## See https://quanteda.io for tutorials and examples....
Please use this issue to add your **testimonials**, **experiences**, **feedback**, etc. We would love to hear from you. #### Updated consent notice 2017-10-06: Note that in leaving a testimonial, you...
As implemented in `tokens_select()`, we could add this to the `window` argument in: - [ ] `kwic()` - [ ] `fcm()` - since `ordered = TRUE` is already like `window...
We forgot `check_dots(...)`... ```r > toks fcm(toks, not_an_argument = TRUE) Feature co-occurrence matrix of: 4 by 4 features. features features A D C E A 3 6 3 3 D...
These have changed a lot recently and I want to get my head clearly around these functions, and how we package them together and document them. I'm starting this issue...
This allows us to tag functions explicitly according to whether they are mature, developing, deprecated, defunct, etc. https://github.com/r-lib/lifecycle
The change to retain the base `character` class for a corpus implies that we should be able to let users use functions on a corpus that operate on character vectors,...
What if we want to form ngrams but then remove any ngram that contains a stopword? This currently does not work: ```r txt % featnames() # [1] "here_:" ":_is" "is_another"...