comparator
comparator copied to clipboard
Token and q-gram based measures
Consider adding support for token-based comparators. After mapping strings to token sets, the similarity of the sets can be measured using:
- Cosine similarity
- Sørensen–Dice coefficient
- Jaccard index
- Tversky index