regex
regex copied to clipboard
Regex.Options.unicode
Usefull for example: "čočka".match(/\p{Letter}+/u)
https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Regular_Expressions/Unicode_Property_Escapes
This is also relevant to the fact that Char.isLower only works for ASCII, a regex-based alternative could make use of those classes
This is also relevant for sanely matching/stripping/replacing Emoji in Elm, with matchers like \p{Emoji}, \p{Emoji_Modifier}, \p{Emoji_Component}, \p{Emoji_Modifier_Base}, \p{Emoji_Presentation}.