vscode-cspell-dict-extensions icon indicating copy to clipboard operation
vscode-cspell-dict-extensions copied to clipboard

accented letters make errors

Open francoisromain opened this issue 6 years ago • 17 comments

Hello and thank you for this useful extension.

I installed the french dictionnary and set "cSpell.language": "en,fr" in the preferences.

It works fine but creates an error when a character is not accentuated in a french word. for exemple: prénom does not make an error, but prenom does. This is a good thing for a regular dictionnary, but not for code. In code, it is a common thing to use french words without accents.

Is there a way to fix that?

thank you

francoisromain avatar Jun 27 '18 13:06 francoisromain

I agree, this is cumberstone for multiple languages. It would be great to have option to accept words without accents.

Mangatt avatar Oct 30 '18 10:10 Mangatt

Support for this is currently in progress.

Jason3S avatar Jan 13 '20 19:01 Jason3S

In Brazilial Portuguese we have this problem too. Waiting for update to support non-acentued words, thank you.

maiconsaraiva avatar Apr 25 '20 15:04 maiconsaraiva

Was this solved?

robertotcestari avatar Jan 16 '21 18:01 robertotcestari

It is currently in progress.

Three things need to happen.

  1. The spell checker needs to be updated to use cspell@5 (In Progress)
  2. The Brazilian dictionary needs to be rebuilt. (Not yet started)
  3. This extension needs to be updated with the new Brazilian dictionary (Not yet started)

Jason3S avatar Jan 24 '21 08:01 Jason3S

Any updates on this?

izacsc avatar Jun 29 '21 23:06 izacsc

From @Jason3S list, point number 1 seems fixed in alpha version:

https://github.com/streetsidesoftware/vscode-spell-checker/pull/942 https://github.com/streetsidesoftware/vscode-spell-checker/commit/e4c2479a5e34430c597b5b076bea70a95ee9729d

So it follows that:

  • It needs to mature and reach vscode marketplace
  • Dictionaries need to be updated

👍

miquelbonastredreivip avatar Jun 30 '21 08:06 miquelbonastredreivip

I'm slowing rebuilding the dictionaries.

Done so far:

  • pt_BR
  • ru
  • de_DE
  • sv_SE
  • es_ES

Up next are French and Dutch.

Are there any preferences?

Jason3S avatar Sep 16 '21 21:09 Jason3S

It would be a nice feature, I've just give up using pt_BR dictionary because of this, @Jason3S is it possible to release the dictionaries that are ready to go and build the remaining in new issues (e.g. per dictionary)?

natenho avatar Mar 16 '22 15:03 natenho

@natenho,

They have already been released. See Spell check ignore accentuation. · Issue #1060 · streetsidesoftware/cspell

Jason3S avatar Mar 18 '22 09:03 Jason3S

@Jason3S

I tried to use the settings described and I could not get it working...here is the json settings:

 "cSpell.language": "en,pt-BR",
    "cSpell.languageSettings": [
        {
            "locale": "pt",
            "languageId": "*",
            "caseSensitive": false,
        },
        {
            "locale": "pt",
            "languageId": [
                "markdown",
                "json",
                "yaml"
            ],
            "caseSensitive": true
        }
    ]

But the accented words are still being marked as mispelled.

image

natenho avatar Mar 18 '22 13:03 natenho

@natenho,

The locales have to be the same or it won't match. pt is not the same as pt-BR.

If you set locale to be pt,pt-BR, it mean match against either pt or pt-BR.

Please try:

    "cSpell.languageSettings": [
        {
            "locale": "pt,pt-BR",
            "languageId": "*",
            "caseSensitive": false,
        },
        {
            "locale": "pt,pt-BR",
            "languageId": [
                "markdown",
                "json",
                "yaml"
            ],
            "caseSensitive": true
        }
    ]

Jason3S avatar Mar 18 '22 13:03 Jason3S

ok, now it works! thank you very much!!

I thought locale was a "contains" match, like stated in your post here:

image

natenho avatar Mar 18 '22 14:03 natenho

I can see how that is misleading.

Jason3S avatar Mar 18 '22 14:03 Jason3S

Up next are French and Dutch.

Can you please add the french version? This extension seems really great but is currently unusable in French because of the amount of accented words. Thank you for your work!

EwenQuim avatar May 02 '22 18:05 EwenQuim

@EwenQuim,

Thank you for the reminder. It has been published.

Jason3S avatar May 03 '22 14:05 Jason3S

Hi @Jason3S,

thank you for all your effort! May I ask you to have a look at the German dictionary, too?

To achieve utf8 compliance it is common in German to substitute umlauts:

  • Ä -> Ae
  • Ö -> Oe
  • Ü -> Ue
  • ä -> ae
  • ö -> oe
  • ü -> ue

Currently my solution is to add all words I encounter to a custom dictionary, but it would be great to have these substitutions build-in.

Cheers,
Arne

ar-std avatar Jun 13 '22 11:06 ar-std