comparator icon indicating copy to clipboard operation
comparator copied to clipboard

Token and q-gram based measures

Open ngmarchant opened this issue 3 years ago • 0 comments

Consider adding support for token-based comparators. After mapping strings to token sets, the similarity of the sets can be measured using:

  • Cosine similarity
  • Sørensen–Dice coefficient
  • Jaccard index
  • Tversky index

ngmarchant avatar Dec 06 '20 09:12 ngmarchant