unicode-categories
unicode-categories copied to clipboard
A simple crate that allows for unicode categories can be queried for characters.
I think it makes sense to optimize for the ASCII ranges before falling back to the binary search for performance reasons. For example check out what go does: https://golang.org/src/unicode/letter.go?s=5421:5446#L182 ```...
When I check https://crates.io/crates/unicode_categories, I see version `0.1.1`, but it is not actually up to date with this repository. I discovered this when I observed out-of-date unicode data. I will...
In https://github.com/swgillespie/unicode-categories/blob/master/src/lib.rs#L182-L183: ``` || self.is_punctuation_close() || self.is_punctuation_close() ``` The second one is redundant and should be removed.
I noticed that the latest – and only – commit containing Unicode property tables was generated in October 2015 – which means that the latest standard this crate can support...
Hi, I would like to use this crate, but I need a function that returns the category from the crate. For example: ```rust let c = 'A'; assert_eq!(c.category(), "Lu"); ```...
Currently, I have a codebase which relies on `unicode-categories`, but mostly uses hard-coded character tables because it needs to match characters based on values in the "BIDI" field, such as...