Mark van der Loo
Mark van der Loo
The title says it all. I may do a PR if get round to it.
[Brody Gaslam](https://cran.r-project.org/web/packages/unitizer/vignettes/u4_reproducible-tests.html) gives a nice, non-exhaustive list of side effects that may be interesting while testing. 1. Workspace / Evaluation Environment 2. Random seed 3. ~Working directory~ 4. Search path...
Errors thrown by user code are not caught by `tinytest`, but we could let R print the stack trace by default.
This function is called for its side effect of registering functions from an extension package. However this only occurs when run from `run_test_file`, or `source` in interactive mode, but not...
Now that the `C` functions have been exported, and CRAN is more strict about macro's like `INTEGER()` it would be good to have input type checking/conversion of `SEXP` objects at...
Suggested by Tom Magerman by e-mail to add - [Dice Coefficient](https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient) - [Sorensen overlap](https://en.wikipedia.org/wiki/Overlap_coefficient) to the q-gram distances
For example ``` stringdist("hello","world",method="cosine", q=1:2) ``` would yield the cosine distance over the concatenation of 1-gram and 2-gram profiles. This would also enhance compatibility, e.g. with the `textcat` package.
Why not, a bit of user-friendlyness :-).
At the moment `stringsim` assumes that all weights are equal to 1 for edit-based distances. Although this does yield a valid maximum (weights are maximally 1), using lower weights will...