Jaume Ortolà
Jaume Ortolà
The issues with soft hyphens are solved with annotated text: https://github.com/languagetool-org/languagetool/pull/6932/commits/403b70f1a3ed2f1de1924fe00f6ceea06bac2363 The combining characters are still marked as spelling errors by LanguageTool. But they don't seem to be a frequent...
The solution for `busca-te` is good and general. I also added a fix for `nada` and `dança` (https://github.com/languagetool-org/languagetool/commit/4eb8f2d1e7e25f6802609d385ebcb059496733f8), but they are more ambiguous words and probably the solution is not...
I know that these words are ambiguous, and it can be difficult to fully disambiguate them. * "o" can be determiner and pronoun (but not preposition!) * "a" can be...
TODO: remove duplicates. For example, in Dutch: ``` old: Thailand, thailand new: Thailand, Thailand ```
There is a global disambiguation rule that should add the POS tags `_IS_URL`. But there is a syntax error in the rule, and the tags are not added. Possible solutions:...
https://github.com/languagetool-org/languagetool/commit/94b3f3f5d7dfc20db5be2524327cc2503999faba
@susanaboatto Could you send me some sentence where "provámos" is synthesized as a suggestion in this or any other rule? If the accent is in the original sentence, it seems...
I wrote this new rule. Please check the message and the rule name. @susanaboatto https://github.com/languagetool-org/languagetool/commit/13048ad8638971d116ab6d374b1825af67e5dfef It is not indispensable because the Brazilian speller already detects the error, but it can...
I don't fully understand what are you asking for. One POS tag for two tokens is not possible. What we do in other languages (French, Catalan) is to tokenize this...
The solution suggested by Marco is good only for rule messages, but not if we want to adjust other messages (e.g. rule names). It should not be difficult to create...