Emil Hvitfeldt

Results 233 issues of Emil Hvitfeldt

Part of https://github.com/tidymodels/planning/issues/29 VERY WIP

Expanding on https://github.com/tidymodels/textrecipes/pull/265 some of the functions, especially the unique variants are still more memory intensive then I would have liked. This could be fixed by writing the in C

feature

Right now we are doing `log( 1 + (N / n_j))`, but wikipedia has `log(N / (1 + n_j)) + 1` and scikit-learn does > If smooth_idf=True (the default), the...

breaking change :skull_and_crossbones:

examples here: https://textrecipes.tidymodels.org/reference/step_tfidf.html

documentation

This worked but it is hardly minimal 😓 https://github.com/tidymodels/textrecipes/pull/251/files#diff-7a4d6e75d2d9b8a28afc680e8d25135692f1aba17f9fff9cc737128ce795aff2

upkeep

Need to figure out how to apply a trained stm model to new data.

feature

Have steps being able to filter by idf values

feature

Overview of the different tokenization options, present in the package and their influence.

documentation