languagetool
languagetool copied to clipboard
[en] a<->an
a->an False Positive
These sentences are already correct since those words start with a "you" sound:
He can be a holder of a usufruct, but this is a matter of contract, not law. Next, I will propose a eudaemonistic definition of distributive justice. This law is needed for a universalized exchange economy.
So:
- usufruct
- eudaemonistic
- universalized
should be added to an a/an exception list.
an->a Wrong Suggestion (American English)
In American English, the word "herb" has a silent 'h', so is spoken more like "erb":
This is an herb that causes catalepsy.
Currently, "a" is recommended.
In American English, "an" is correct. (Although it may depend on dialect (?).)
In British English, this rule can stay as is, since they use a hard 'h'.
Unsure of Australian/Canadian/Others.
a->an Completely Missed
"a SEC" -> "an SEC"
This will trigger a SEC investigation and eventual prosecution.
I also caught this today.
LT doesn't detect this in quotes:
a->an
Hand me a "s".
but it does detect no-quote version:
a->an (Correct)
Hand me a s.
LT correctly detects most vowel-sound alphabet letters with/without quotes too:
a->an (Correct)
Hand me a "a". Hand me a "e". Hand me a "f". Hand me a "i". Hand me a "l". Hand me a "m". Hand me a "n". Hand me a "o". Hand me a "r". Hand me a "x".
But LT is wrong on "h" and "u".
a->an (Wrong recommendation)
Hand me a "u".
'u' is spoken like "you".
a->an (Undetected)
Hand me a "h".
"h" is an odd one though.
- In American English, 'h' is like you're saying "eight" with a "ch" on the end: "aitch"
- In British/Australian, I believe many accents pronounce an 'h' like: "haitch"
- See Youtube video: "Say letter H - #aitch vs #haitch H pronunciation"
Fixing a/an issues...
Is it as simple as tossing the words into these 2 files?
- det_a.txt + det_an.txt
?
If so, I could create a pull request.
(And I might have a lot of other words to correct too.)
A few I recommended in #4352 could go in there.
A:
- eudaemonism
- eudaimonism
- eudaemonist
- eudaimonist
- eudaemonists
- eudaimonists
- eudaemonistic
- eudaimonistic
- usufruct
- usufructs
- usufructuary
- usufructuaries
- universalized
- universalised
(All of these start with a "you" sound.)
Pull request above adds 175 words from M-W.
I didn't add these 2 words, but they also start with a "You" sound (so would also have "A" before):
Iugurtha = alternate spelling of "Jugurtha" (who was a Numidian king) https://www.merriam-webster.com/dictionary/iugurtha https://en.wikipedia.org/wiki/Jugurtha
justitia omnibus = latin phrase ("justice for all") https://www.merriam-webster.com/dictionary/justitia%20omnibus
There's also the completely non-standard:
- uniquer
- https://english.stackexchange.com/questions/7441/is-uniquer-a-word
- https://en.wiktionary.org/wiki/uniquer
- uniquest
- https://www.urbandictionary.com/define.php?term=uniquest
Added 185 more words.
This comment exists in det_a:
this is for "union-ized"; the different lemma "un-ionized" requires "an"
This exception would apply to these words as well:
- unionisation
- unionise
- unionised
- unioniser
- unionization
- unionize
- unionizer
In the past year, I ran across some missed errors with:
- a/an + numbers (especially fractions)
Example:
an -> a
- [ ] Now it will only be an 1 cent difference.
- [ ] It’s probably about an 1/2 acre of grass.
- [ ] It’s probably about an 2 acre prairie.
This is a little more problematic, because it's dealing with spoken numbers:
- "an one cent" -> "a one cent"
- "an half acre" -> "a one-half acre" OR "a half acre"
- "an two" -> "a two"
There are ambiguous cases when spoken, like:
- [ ] Cut me a 1/8 slice of pie.
which can be:
- "an eighth" OR "a one eighth".
LT currently gets this correct:
a -> an
- You will get a 8.2% raise.
- "a eight" -> "an eight"
but I'm not sure what exact rule triggers it.
Note: Should LT deal with the Unicode "Vulgar Fractions" as well?
- ¼ = U+00BC = VULGAR FRACTION ONE QUARTER
- ½ = U+00BD = VULGAR FRACTION ONE HALF
- ¾ = U+00BE = VULGAR FRACTION THREE QUARTERS
- ⅐ = U+2150 = VULGAR FRACTION ONE SEVENTH
- ⅑ = U+2151 = VULGAR FRACTION ONE NINTH
- ⅒ = U+2152 = VULGAR FRACTION ONE TENTH
- ⅓ = U+2153 = VULGAR FRACTION ONE THIRD
- ⅔ = U+2154 = VULGAR FRACTION TWO THIRDS
- ⅕ = U+2155 = VULGAR FRACTION ONE FIFTH
- ⅗ = U+2157 = VULGAR FRACTION THREE FIFTHS
- ⅘ = U+2158 = VULGAR FRACTION FOUR FIFTHS
- ⅙ = U+2159 = VULGAR FRACTION ONE SIXTH
- ⅚ = U+215A = VULGAR FRACTION FIVE SIXTHS
- ⅛ = U+215B = VULGAR FRACTION ONE EIGHTH
- ⅜ = U+215C = VULGAR FRACTION THREE EIGHTHS
- ⅝ = U+215D = VULGAR FRACTION FIVE EIGHTHS
- ⅞ = U+215E = VULGAR FRACTION SEVEN EIGHTHS
- ↉ = U+2189 = VULGAR FRACTION ZERO THIRDS