use SymSpell with hunspell dictionaries?
This is not really a bug report, sorry for using the issue tracker for this, but I'd like to see if someone has worked on this. Please point me to a better place if you know one:
Has anybody worked on using SymSpell with German hunspell dictionaries? German uses compounds, so you cannot just export a long list from the hunspell dictionaries and use them as input for SymSpell. The hunspell dictionary has special flags that indicate which words can be used in a compound, these would need to be considered somehow.
You could use the German dictionary from Aspell . This is a plain word list. In order to add the word frequencies required by SymSpell you can combine/intersect it with Google Books Ngram data . The required code to create such custom word frequency dictionaries you can find here: https://github.com/wolfgarbe/SymSpell/issues/15
That being said, making SymSpell compatible with hunspell dictionaries would be certainly useful. I will add this to the todo list.
Please add suffix and prefix support like Hunspell. It will be very useful. Hunspell dictionaries are pretty good and up to date.
Wolf, should we expect any update with regards to Hunspell dictionaries anytime soon? Thank you for your excellent work so far.