languagetool
languagetool copied to clipboard
[fr] false alarm with domain in CONF_N_V
J'utilise google.com
@jaumeortola any idea how we can immunize or better tokenize domains?
There is a global disambiguation rule that should add the POS tags _IS_URL
. But there is a syntax error in the rule, and the tags are not added.
Possible solutions:
- Add the tags in global disambiguation, and add exceptions in rules that need it (
<exception postag="_IS_URL"/>
). - Fully immunize the domain name. Now it is only "ignore_spelling".
- Replace all the existing tags with _IS_URL.
For now, I implement (1) for this French sentence. (2) or (3) are more general solutions (but there could be undesired effects).
https://github.com/languagetool-org/languagetool/commit/94b3f3f5d7dfc20db5be2524327cc2503999faba
Thanks so much!