spellchecker icon indicating copy to clipboard operation
spellchecker copied to clipboard

Dictionary files <pt_BR.dic> & <pt_BR.aff> not recognized.

Open leodrivera opened this issue 2 years ago • 2 comments

Description

Dictionary files <pt_BR.dic> & <pt_BR.aff>, from the LibreOffice and Chromium, are not recognized nor loaded as online dictionaries.

Reproduce

I've tried both as online and offline dictionaries:

  • On the online attempt, used the following config in SpellChecker settings, under JupyterLab.

    [
        {
            "id": "pt_BR",
            "aff": "https://raw.githubusercontent.com/LibreOffice/dictionaries/master/pt_BR/pt_BR.aff",
            "dic": "https://raw.githubusercontent.com/LibreOffice/dictionaries/master/pt_BR/pt_BR.dic",
            "name": "Português (Brasil)"
        }
    ]
    
    

    When I try to load the dict, it keeps eternally showing Loading dictionary…

    I also tested using the dict that is shown in the example, and it worked correctly:

[
    {
        "id": "en_US-online",
        "aff": "https://cdn.jsdelivr.net/codemirror.spell-checker/latest/en_US.aff",
        "dic": "https://cdn.jsdelivr.net/codemirror.spell-checker/latest/en_US.dic",
        "name": "My favorite variant of English"
    }
]
  • On the offline attempt, I followed what is described in the README After adding the these 2 files (pt_BR.aff and pt_BR.dic), no new dictionary appeared. Tested with the files from both LibreOffice and Chromium repos.

Expected behavior

That the dictionaries were loaded correctly, both by the "online" and "offline" method

Context

  • Python package version:

pip 22.2.2

  • Extension version:

Config dir: /home/leodrivera/venv/data-analise/etc/jupyter jupyter_server_mathjax enabled - Validating jupyter_server_mathjax... jupyter_server_mathjax OK jupyterlab enabled - Validating jupyterlab... jupyterlab 3.4.4 OK jupyterlab_git enabled - Validating jupyterlab_git... jupyterlab_git 0.37.1 OK jupyterlab_pullrequests enabled - Validating jupyterlab_pullrequests... jupyterlab_pullrequests 3.0.2 OK jupyterlab_spellchecker enabled - Validating jupyterlab_spellchecker... jupyterlab_spellchecker 0.7.2 OK jupyterlab_templates enabled - Validating jupyterlab_templates... jupyterlab_templates 0.3.1 OK nbclassic enabled - Validating nbclassic... nbclassic 0.4.3 OK nbdime enabled - Validating nbdime... nbdime 3.1.1 OK notebook_shim enabled - Validating notebook_shim... notebook_shim OK voila.server_extension enabled - Validating voila.server_extension... voila.server_extension OK

  • Operating System and its version: Ubuntu 20.04.4 LTS
  • Browser and its version: Google Chrome Version 104.0.5112.101 (Official Build) (64-bit)

leodrivera avatar Aug 23 '22 19:08 leodrivera

OK, that is understandable ... The reason is that the underlying spellcheck engine has a lot of problems with the aff-files in the case of roman languages with a lot of differentiations due to grammar rules. I had the same problem with the italien dictionary I found for free. The engine is simply enrolling all rules into a list of words, which is huge in your case ;-) we are looking for a better engine which also will solve some other problems, e.g. multi/mixed languages support.

ocordes avatar Aug 25 '22 08:08 ocordes

@ocordes You said on the last comment that you guys were looking after a new engine. Since it has passed almost 3 years and a lot of contributions were made since them, do we still have this issue?

leodrivera avatar Mar 19 '24 21:03 leodrivera