languagetool icon indicating copy to clipboard operation
languagetool copied to clipboard

[en] a<->an

Open Tex2002ans opened this issue 4 years ago • 5 comments

a->an False Positive

These sentences are already correct since those words start with a "you" sound:

He can be a holder of a usufruct, but this is a matter of contract, not law. Next, I will propose a eudaemonistic definition of distributive justice. This law is needed for a universalized exchange economy.

So:

  • usufruct
  • eudaemonistic
  • universalized

should be added to an a/an exception list.


an->a Wrong Suggestion (American English)

In American English, the word "herb" has a silent 'h', so is spoken more like "erb":

This is an herb that causes catalepsy.

Currently, "a" is recommended.

In American English, "an" is correct. (Although it may depend on dialect (?).)

In British English, this rule can stay as is, since they use a hard 'h'.

Unsure of Australian/Canadian/Others.


a->an Completely Missed

"a SEC" -> "an SEC"

This will trigger a SEC investigation and eventual prosecution.

Tex2002ans avatar Feb 17 '21 06:02 Tex2002ans

I also caught this today.

LT doesn't detect this in quotes:

a->an

Hand me a "s".

but it does detect no-quote version:

a->an (Correct)

Hand me a s.

LT correctly detects most vowel-sound alphabet letters with/without quotes too:

a->an (Correct)

Hand me a "a". Hand me a "e". Hand me a "f". Hand me a "i". Hand me a "l". Hand me a "m". Hand me a "n". Hand me a "o". Hand me a "r". Hand me a "x".

But LT is wrong on "h" and "u".

a->an (Wrong recommendation)

Hand me a "u".

'u' is spoken like "you".

a->an (Undetected)

Hand me a "h".

"h" is an odd one though.

Tex2002ans avatar Feb 18 '21 09:02 Tex2002ans

Fixing a/an issues...

Is it as simple as tossing the words into these 2 files?

  • det_a.txt + det_an.txt

?

If so, I could create a pull request.

(And I might have a lot of other words to correct too.)


A few I recommended in #4352 could go in there.

A:

  • eudaemonism
  • eudaimonism
  • eudaemonist
  • eudaimonist
  • eudaemonists
  • eudaimonists
  • eudaemonistic
  • eudaimonistic
  • usufruct
  • usufructs
  • usufructuary
  • usufructuaries
  • universalized
  • universalised

(All of these start with a "you" sound.)

Tex2002ans avatar Feb 22 '21 10:02 Tex2002ans

Pull request above adds 175 words from M-W.

I didn't add these 2 words, but they also start with a "You" sound (so would also have "A" before):

Iugurtha = alternate spelling of "Jugurtha" (who was a Numidian king) https://www.merriam-webster.com/dictionary/iugurtha https://en.wikipedia.org/wiki/Jugurtha

justitia omnibus = latin phrase ("justice for all") https://www.merriam-webster.com/dictionary/justitia%20omnibus

There's also the completely non-standard:

  • uniquer
    • https://english.stackexchange.com/questions/7441/is-uniquer-a-word
    • https://en.wiktionary.org/wiki/uniquer
  • uniquest
    • https://www.urbandictionary.com/define.php?term=uniquest

Tex2002ans avatar Mar 06 '21 22:03 Tex2002ans

Added 185 more words.

This comment exists in det_a:

this is for "union-ized"; the different lemma "un-ionized" requires "an"

This exception would apply to these words as well:

  • unionisation
  • unionise
  • unionised
  • unioniser
  • unionization
  • unionize
  • unionizer

Tex2002ans avatar Mar 10 '21 16:03 Tex2002ans

In the past year, I ran across some missed errors with:

  • a/an + numbers (especially fractions)

Example:

an -> a

  • [ ] Now it will only be an 1 cent difference.
  • [ ] It’s probably about an 1/2 acre of grass.
  • [ ] It’s probably about an 2 acre prairie.

This is a little more problematic, because it's dealing with spoken numbers:

  • "an one cent" -> "a one cent"
  • "an half acre" -> "a one-half acre" OR "a half acre"
  • "an two" -> "a two"

There are ambiguous cases when spoken, like:

  • [ ] Cut me a 1/8 slice of pie.

which can be:

  • "an eighth" OR "a one eighth".

LT currently gets this correct:

a -> an

  • You will get a 8.2% raise.
    • "a eight" -> "an eight"

but I'm not sure what exact rule triggers it.


Note: Should LT deal with the Unicode "Vulgar Fractions" as well?

  • ¼ = U+00BC = VULGAR FRACTION ONE QUARTER
  • ½ = U+00BD = VULGAR FRACTION ONE HALF
  • ¾ = U+00BE = VULGAR FRACTION THREE QUARTERS
  • ⅐ = U+2150 = VULGAR FRACTION ONE SEVENTH
  • ⅑ = U+2151 = VULGAR FRACTION ONE NINTH
  • ⅒ = U+2152 = VULGAR FRACTION ONE TENTH
  • ⅓ = U+2153 = VULGAR FRACTION ONE THIRD
  • ⅔ = U+2154 = VULGAR FRACTION TWO THIRDS
  • ⅕ = U+2155 = VULGAR FRACTION ONE FIFTH
  • ⅗ = U+2157 = VULGAR FRACTION THREE FIFTHS
  • ⅘ = U+2158 = VULGAR FRACTION FOUR FIFTHS
  • ⅙ = U+2159 = VULGAR FRACTION ONE SIXTH
  • ⅚ = U+215A = VULGAR FRACTION FIVE SIXTHS
  • ⅛ = U+215B = VULGAR FRACTION ONE EIGHTH
  • ⅜ = U+215C = VULGAR FRACTION THREE EIGHTHS
  • ⅝ = U+215D = VULGAR FRACTION FIVE EIGHTHS
  • ⅞ = U+215E = VULGAR FRACTION SEVEN EIGHTHS
  • ↉ = U+2189 = VULGAR FRACTION ZERO THIRDS

Tex2002ans avatar Jun 25 '22 09:06 Tex2002ans