languagetool icon indicating copy to clipboard operation
languagetool copied to clipboard

[pt] Improved disambiguator: tags for punctuantion

Open marcoagpinto opened this issue 2 years ago • 3 comments

Hello @jaumeortola

I wondered if you could extend a bit the disambiguator to detect tokens like: »«"”‘‹›' Enhance to accept:

_QUOT_OPEN
_QUOT_CLOSE

And with punctuation: ,.!? …

Enhance to accept: _PUNCT_COMMA _PUNCT_PERIOD _PUNCT_EXCLAMATION _PUNCT_INTERROGATION _PUNCT_PERIOD3

Also, maybe something similar could be done with brackets.

Thank you!

marcoagpinto avatar Jun 04 '22 09:06 marcoagpinto

I have added these tags. See disambiguation rules: PUNCTUATION, PUNCT, TRES_PONTOS and QUOT.

But there are some duplications and inconsistencies in the tags. Perhaps we should solve these minor problems, before using the tags in the rules. @marcoagpinto

jaumeortola avatar Sep 02 '22 11:09 jaumeortola

@jaumeortola _QUOT wasn't already there?

marcoagpinto avatar Sep 02 '22 11:09 marcoagpinto

_QUOT wasn't already there?

It was there. I mentioned all existing and new rule IDs with tags for punctuation.

jaumeortola avatar Sep 02 '22 11:09 jaumeortola