Ted Enamorado
Ted Enamorado
@ajw5296 if you are using the fastLink wrapper function, it is not possible (those cutpoints are global). If anything, let us know. All my best, Ted
@ajw5296 can you provide an example of what you have in mind here? Is your question about cutoff about how we compare variables or about the weight each variable receives...
Hi @ajw5296, As @aalexandersson mentions, it is not possible to set deterministic rules based on the probability of observing a specific agreement value for field k given that a pair...
Hi @shamahutoto, As @aalexandersson mentioned, you can either block on a certain variable. Note that for all the variables that you pass to fastLink that are not listed in `stringdist.match`...
Thanks for raising this issue! The calls to `gc()` come from the original `fastLink` code written in 2015/2016. I sense that recycling did not work as well as it does...
Hi, Thanks a lot for your great feedback! We are working on a vignette for fastLink with many examples and practical advice. We will definitively incorporate the points you raise....
Hi @MAranzazuRU89, Like @aalexandersson mentions, a bit more context could be of help here. If it happens that your data allows for blocking (creating subsets of observations similar in at...
Hi @ishanaratan, If you are using a cluster computer. I would do the following: 1. Block the data. For example, if you match firms from different cities, one idea is...
Thanks so much for sharing this with us! We will try your function but feel free to make a pull request. In `blockData` we are using a function that goes...
Hi @emcghee73, Matching voter files is not an easy task, so I am with you on how complicated this can be. However, it is possible and it is just a...