Kenneth Benoit
Kenneth Benoit
And: Should we increment this now to v4.1.0?
@koheiw can you resolve the conflict in `src/tokens_xptr.cpp` and update with the latest `master`?
@koheiw could you pls resolve the conflict in `tokens_compound.R`? I think I know the right code section but wanted to be sure.
So we would not be able to count paddings when creating a dfm? That seems like a loss. What if we want to know the total removals by document?
I think we discussed this before, and didn't implement them simply due to naming: tokens objects don't have "features". But we could define tokens methods for these, I'm fine with...
That could work well, especially with corpus_reshape, since the vector representation could then apply to any reshaped subset of a document that people wanted to define. Would we need to...
I remember this being one of the earliest problems I encountered as a **quanteda** use case, over 10 years ago. That's why the pattern exists in `corpus_segment()`, because I was...
Makes sense, but we should deprecate it rather than delete it, for a version or three.