github-typo-corpus
github-typo-corpus copied to clipboard
Getting just the spelling mistakes of words?
Hi,
The very first entry has a correction that simply changes ){
to ) {
(it inserts a space). For my application, I'd like to focus on typos of English words only. Do you have a suggested way to filter these out?
Your paper says that you annotated the data set with some classifications, such as "Spell" when the error was a spelling mistake. I think this would help me to do the filtration I need to. Is it possible to get these annotations?
Thank you