Wolf Garbe

Results 61 comments of Wolf Garbe

It seems like somebody did indeed benchmark SymSpell vs. Lucene's spelling correction: ![image](https://user-images.githubusercontent.com/7057583/151679344-59a7b670-46e7-4873-941d-a597e2d1f387.png) Source: https://github.com/Shivakumar-Narayanan/Spell-Check https://docs.google.com/document/d/1QQROh8ndwBHbPwx2t1kKZcquHDDfF0Pz

SymSpell uses UTF-8, and UTF-8 supports accented characters like å. If your text files are in a different encoding you could alway convert the text files to UTF-8 before consuming...

I see your point. But both Javascript and Python don't support the interface concept, while they are the most actively used ports of SymSpell. And even in the other languages...

Thx. Typo will be fixed.

1. On which Operating System you are testing? If anything other than Windows try to comment out the lines 53...55 (Console resize is probably not supported on your platform): /*...

SymSpell.LookupCompound should do exactly this. It uses the optional bigram dictionary (load with symSpell.LoadBigramDictionary) in order to use sentence level context information for selecting the best spelling correction for multiple...

If you attach the French frequency dictionary and the bigram dictionary files to the issue in plain text format, I could have a look what goes wrong (in SymSpell and/or...

> When all will be fixed I will replicate this for the other languages of the google n-grams and I will give you the files so that this framework can...

> I will maybe add a phonetic or POS layer. Do you plan to have this kind of improvements ? Implementing a **weighted edit distance** giving a higher rank to...

Unfortunately, I have not yet found the time, but it is still on my mind.