csvdedupe
csvdedupe copied to clipboard
A way to set the seed and/or pickup where you left off
I have a use case where I need to be able to restart the process where we left off, basically calling csvdedupe as a service to create training sets. I need to find a way that I can call csvdedupe get a pair to match against, call it again give it the answer, and have it store the settings and state. I thought I could do this by setting the random.seed value but I can't find a command line option for this.
I found https://github.com/dedupeio/dedupe/issues/541 and https://github.com/dedupeio/dedupe/issues/643 but neither has an actual solution and were closed due to inactivity.
Any ideas?
Thanks, Jeremy