proselint
proselint copied to clipboard
Extract rules from "LanguageTool" style checker
https://languagetool.org/
https://github.com/languagetool-org/languagetool/tree/master/languagetool-language-modules/en/src/main/java/org/languagetool/rules/en
Bingo: http://community.languagetool.org/rule/list?lang=en
Holy shit.
Running LanguageTool on a random 20,000 article subset of the English Wikipedia led to 37,000 errors being detected. However, many of these errors are false alarms, either because of problems with the Wikipedia syntax or because the LanguageTool error patterns are too strict. So we manually looked at 200 of the errors, finding that 29 of the 200 errors were real errors.
Their false alarm rate is 85%.
https://github.com/languagetool-org/languagetool/blob/master/languagetool-language-modules/en/src/main/resources/org/languagetool/rules/en/grammar.xml