fuzzyjoin icon indicating copy to clipboard operation
fuzzyjoin copied to clipboard

Join tables together on inexact matching

Results 43 fuzzyjoin issues
Sort by recently updated
recently updated
newest added

It seems if we use a list of match_fun's for each matching variable, it is not possible to pass arguments into them? https://stackoverflow.com/questions/44383510/r-passing-arguments-into-the-match-fun-function-in-fuzzyjoinfuzzy-join

I was trying to match dates to intervals, first as a toy example with integers, then with dates. Using integer ranges instead of date intervals requires me to make use...

Is it possible to allow `max_dist` to be a vector so that it can be different for each column passed to `stringdist_join`'s `by` argument? Something like this: `stringdist_inner_join(df1, df2, by=c("col1",...

Here is short vignette in response to your [call](https://github.com/dgrtwo/fuzzyjoin#future-work), showing a use that seems in demand, but not easily available elsewhere, cf. https://github.com/hadley/dplyr/issues/557 and http://stackoverflow.com/q/41132081/1036500. Let me know what you...

I'm experimenting with matching along n variables (ex `x1` and `x2`) and want to keep track of the distance for each variable (`distance_col = "distance"`). You can do this, but...

Hi David, Thank you for all you have done for the R community. This is just to provide you with an example case of using fuzzy_left_join() in order to add...

I have two data frames. I need to merge them based on a partial string match. Data frame A has Gene.Name column with EHBP1. Data frame B has Gene.Symbols column...

I've refactored the entire package so I could get a better understanding of what it does since there was a lot of nested conditions, nested function definitions, and different types...

Thank you for this package! I used it for assigning people to generations (Boomers, Millenials, etc) depending on the year they were born (also possible to base it on age...

The [docs](https://www.rdocumentation.org/packages/fuzzyjoin/versions/0.1.6/topics/stringdist_join) state that > If method = "soundex", the max_dist is automatically set to 0.5, since soundex returns either a 0 (match) or a 1 (no match). And that's...