mmb L

Results 16 comments of mmb L

Is this similar to https://github.com/mammothb/symspellpy/issues/92? I believe this is because bigrams are only used when a term from the input phrase is split up, e.g., when `tengkerang` is split to...

Hi @pineking I have tried input term you used with the following code ```python import pkg_resources from symspellpy.symspellpy import SymSpell sym_spell = SymSpell(max_dictionary_edit_distance=0, prefix_length=7) dictionary_path = pkg_resources.resource_filename( "symspellpy", "frequency_dictionary_en_82_765.txt" )...

This is currently intentional as enabling `ignore_token` in `lookup_compound` may require significant change to the code. As I am not the original creator of the algorithm, I am afraid I...

Do you have a sample code snippet which can show the error? And also, may I know how did you obtain "frequency_dictionary_en_82_765.txt" file, i.e., simply download or copy/paste into a...

> but I was using https://github.com/wolfgarbe/SymSpell/blob/master/SymSpell/frequency_dictionary_en_82_765.txt The dictionary file from the original SymSpell repository is saved with the UTF-8-BOM encoding. And `load_dictionary()` opens the file using UTF-8 encoding by default....

@rebouvet Hi, can you upload a sample of the dictionary which causes the error so I can try and debug?

@lucaslrolim i was able to run word_segmentation without a `StopIteration` error with the following code ```python import os.path from symspellpy.symspellpy import SymSpell # Set max_dictionary_edit_distance to avoid spelling correction sym_spell...

@vection I see you're swapping out the `"frequency_dictionary_en_82_765.txt"` from the sample code with your own dictionary. However, `pkg_resources` only find the dictionaries that's shipped with the `symspellpy` packages. As `"fr-100k.txt"`...

@alexvaca0 @rebouvet May I know if you have a similar problem with the dictionary path not pointing to the right location? Similar to what I have described in https://github.com/mammothb/symspellpy/issues/79#issuecomment-731492752

@potipot I believe the error is due to the assumption that the default edit distance algorithm used is `DAMERAU_OSA` (following the implementation of the [original project](https://github.com/wolfgarbe/SymSpell)). I have checked on...