Salomon

Results 34 issues of Salomon

When assigning final ids, use a user-provided threshold. Better yet, allow the user to pass multiple thresholds, and create either multiple unique_map tables or a longer form unique_map that also...

Docs should have an explainer of the outputs, including what is stored in all of the output tables.

Labeled training example pairs should be stored in a table for selection and reuse. Data stored for examples should include: - Source - Source ids - Label - Label date...

[This issue](https://github.com/dedupeio/dedupe/issues/538) suggests using gazetteer methods Documentation: https://dedupe.io/developers/library/en/latest/API-documentation.html#gazetteer-objects Code: https://github.com/dedupeio/dedupe/blob/master/dedupe/api.py#L985