Chris Little
Chris Little
Also, this alignment seem sub-optimal. (I think the l in Neil should be matched with an l in Niall.) >>> cmp.alignment('Niall', 'Neil') (2.526064024369237, 'N-iall', 'Neil--')
fixed alignment issue in b04ca90b
This is a result of the normalizing term in combination with the discounting function. It's worth re-examining this issue to determine if the supplied discounting functions are good, but it's...
May it be better to remove snowball stemmers altogether in favor of pystemmer? https://github.com/snowballstem/pystemmer
punt to 0.4 (if ever)
Ainsworth implemented in f5998314
snowball supports this stemmer as well, so if relying on that, it's unnecessary to add this
pushed back to 0.4... and may push back further; needs evaluation
~~explain badges in manual~~ (completed 435a6533c83a605205e97c50e34642e93971caf3)
bumped to 0.5 for ConfusionTable & Corpus/NGramCorpus notebooks At a minimum, the SOTU corpus should probably be incorporated.