stringdist
stringdist copied to clipboard
String distance functions for R
All seq_* function documentation refers to the `stringdist-metrics` documentation for usage of the `method` parameter. The documentation of `stringdist-metrics` refers only to the metric definitions when used with character input,...
Why not, a bit of user-friendlyness :-).
At the moment `stringsim` assumes that all weights are equal to 1 for edit-based distances. Although this does yield a valid maximum (weights are maximally 1), using lower weights will...
I was wondering if you've thought of including qgram filtering for edit distance in the stringdist package. Oftentimes users are only concerned with comparing strings that pass a certain similarity...
Adding support for [string kernel distances](http://www.jmlr.org/papers/volume2/lodhi02a/lodhi02a.pdf) would be nice.
At the moment all vectors are indexed with `int` in the underlying C-code. I should update this to `size_t`