SymSpell icon indicating copy to clipboard operation
SymSpell copied to clipboard

use SymSpell with hunspell dictionaries?

Open danielnaber opened this issue 7 years ago • 3 comments

This is not really a bug report, sorry for using the issue tracker for this, but I'd like to see if someone has worked on this. Please point me to a better place if you know one:

Has anybody worked on using SymSpell with German hunspell dictionaries? German uses compounds, so you cannot just export a long list from the hunspell dictionaries and use them as input for SymSpell. The hunspell dictionary has special flags that indicate which words can be used in a compound, these would need to be considered somehow.

danielnaber avatar Jul 09 '18 10:07 danielnaber

You could use the German dictionary from Aspell . This is a plain word list. In order to add the word frequencies required by SymSpell you can combine/intersect it with Google Books Ngram data . The required code to create such custom word frequency dictionaries you can find here: https://github.com/wolfgarbe/SymSpell/issues/15

That being said, making SymSpell compatible with hunspell dictionaries would be certainly useful. I will add this to the todo list.

wolfgarbe avatar Jul 09 '18 11:07 wolfgarbe

Please add suffix and prefix support like Hunspell. It will be very useful. Hunspell dictionaries are pretty good and up to date.

akaSourav avatar Dec 11 '19 02:12 akaSourav

Wolf, should we expect any update with regards to Hunspell dictionaries anytime soon? Thank you for your excellent work so far.

zephyrous avatar May 16 '20 16:05 zephyrous