fuzzyjoin
fuzzyjoin copied to clipboard
Join tables together on inexact matching
It seems if we use a list of match_fun's for each matching variable, it is not possible to pass arguments into them? https://stackoverflow.com/questions/44383510/r-passing-arguments-into-the-match-fun-function-in-fuzzyjoinfuzzy-join
I was trying to match dates to intervals, first as a toy example with integers, then with dates. Using integer ranges instead of date intervals requires me to make use...
Is it possible to allow `max_dist` to be a vector so that it can be different for each column passed to `stringdist_join`'s `by` argument? Something like this: `stringdist_inner_join(df1, df2, by=c("col1",...
Here is short vignette in response to your [call](https://github.com/dgrtwo/fuzzyjoin#future-work), showing a use that seems in demand, but not easily available elsewhere, cf. https://github.com/hadley/dplyr/issues/557 and http://stackoverflow.com/q/41132081/1036500. Let me know what you...
I'm experimenting with matching along n variables (ex `x1` and `x2`) and want to keep track of the distance for each variable (`distance_col = "distance"`). You can do this, but...
Hi David, Thank you for all you have done for the R community. This is just to provide you with an example case of using fuzzy_left_join() in order to add...
I have two data frames. I need to merge them based on a partial string match. Data frame A has Gene.Name column with EHBP1. Data frame B has Gene.Symbols column...
I've refactored the entire package so I could get a better understanding of what it does since there was a lot of nested conditions, nested function definitions, and different types...
Thank you for this package! I used it for assigning people to generations (Boomers, Millenials, etc) depending on the year they were born (also possible to base it on age...
The [docs](https://www.rdocumentation.org/packages/fuzzyjoin/versions/0.1.6/topics/stringdist_join) state that > If method = "soundex", the max_dist is automatically set to 0.5, since soundex returns either a 0 (match) or a 1 (no match). And that's...