spaCy
spaCy copied to clipboard
Character-based orthographic variants
Feature description
Similar to the token-based orthographic variants, it would be useful to add data augmentation options for character-based orthographic variants. Examples are the Romanian variants discussed in #4736 and German ß.