Shane F. Carr

Results 1236 comments of Shane F. Carr

Some initial numbers of postcard with different fallback modes. Number in parentheses is the point in the postcard file at which the sorted locale lookup VarZeroVec ends and the data...

Very initial estimates for the impact of ZeroTrie on the postcard locale lookup table size, based on the strings in the compiled_data files (not the same set of locales as...

I missed something in https://github.com/unicode-org/icu4x/issues/3865#issuecomment-1773976272. The lookup table is not only a VZV of locale strings; it is also a FZV of a mapping from the VZV index to the...

I implemented a ZeroTrie version of BlobSchema in #4207. Results for Gregorian, runtime fallback, and all locales: | Data Key | Postcard Size | |---|---| | datesymbols | 185248 |...

> `datetime/patterns//date/numeric@1` What goes into this patterns key? Do you mean a symbol key like `datetime/symbols//months/numeric@1`?

> icu_datetime will attempt to load day/numeric@ for the corresponding length and resolved locale The other problem with this is that fallback isn't free. We should do things like this...

Assigning to @eggrobin since you're already in the thick of sentence segmentation.

To be clear, we're talking about this data: https://github.com/unicode-org/cldr/blob/main/common/segments/en.xml ```xml L.P. Alt. Approx. E.G. O. Maj. Misc. ```

CC @makotokato If ICU4C has a trie, you could re-use it. Else, it's perfectly fine to build a trie in ICU4X datagen. You can use [zerotrie](https://unicode-org.github.io/icu4x/rustdoc/zerotrie/index.html)::[ZeroTriePerfectHash](https://unicode-org.github.io/icu4x/rustdoc/zerotrie/struct.ZeroTriePerfectHash.html#), for example.