abydos
abydos copied to clipboard
Edit distances should return their alignments
Candidates:
- [x] Levenshtein/OSA
- [ ] Damerau-Levenshtein
- [x] Yujian-Bo
- [ ] Higuera-Mico
- [x] Indel
- [ ] SAPS
- [ ] Meta-Levenshtein
- [ ] FlexMetric
- [ ] BI-SIM
- [ ] BlockLevenstein
- [ ] Rees-Levenshtein
- [x] Phonetic Edit Distance
- [x] Discounted Levenshtein
- [ ] ShapiraStorerI
Damerau-Levenshtein should be high priority. Without it, Soft Jaccard can't be implemented with its default parameters, which include using DL as the alignment algorithm/distance measure
Alignments really need to be done by creating a secondary matrix to track the tracebacks, as in PhoneticEditDistance & DiscountedLevenshtein. The greedy method won't always get the correct alignment.