fuzzyjoin icon indicating copy to clipboard operation
fuzzyjoin copied to clipboard

Join tables together on inexact matching

Results 43 fuzzyjoin issues
Sort by recently updated
recently updated
newest added

Any plans to support dplyr style calls to external databases, even for some subset of functions? I'm guessing that's out of scope for this package, but it would be nice...

Is there any way to match on normalized string distance? Ie for osa, lv, and dl, dividing by the nchar of the longest word, or for lcs the additive lengths...

`geo_join` does a full _m*n_ comparison of points. This works for moderately sized datasets but becomes intractable in both time and memory. The right `geo_join()` implementation would likely use [k-d...

In this example 100-110 is not joined with 111-112, but 10-11 is joined with 11.1-11.2. ```{r} x1

In addition to providing examples of match_fun's #22 , it looks like match_fun gets used as multi_match_fun if match_fun is singular and there are multiple column's in the by argument?...

I can't find an example for how match_fun works anywhere online

I just postd [this question](http://stackoverflow.com/questions/42932160/finding-approximately-matching-rows-of-numerical-data-in-r) which contains the following example: ``` x = matrix( 1:100, nrow = 100, ncol= 4 ,byrow = FALSE) y = x + matrix( .001 *...

Similar to how [`bedtools intersect`](http://bedtools.readthedocs.io/en/latest/content/tools/intersect.html) works, it would be great if the interval/genome join methods in `fuzzyjoin` would also calculate (maybe optionally) the number of bases in the intersection (ie....

I'm trying with 'regex_join' function with sample data below. ``` library(fuzzyjoin) library(dplyr) library(readr) # Create 'user' data frame user activity %>% + regex_left_join(user, by = c("username" = "regex_name")) Source: local...