icu4x
icu4x copied to clipboard
Solving i18n for client-side and resource-constrained environments.
@younies had assumed that `SinglePlaceholderPattern` required exactly one placeholder in the pattern, when in fact it supports _zero or one_. Given that this is primarily an internal-facing type and that...
Spawned out of https://github.com/unicode-org/icu4x/issues/3575 Some users may want to solely run rule based segmentation, perhaps if they are on low-resource systems. Given the conclusion in https://github.com/unicode-org/icu4x/issues/3575 that the dictionary/lstm keys...
Spinning out of #4996 In the initial PR I'm introducing closed enums for keys such as `Collation`. The motivator is that we only need key variants for values we have...
[WB3c](https://www.unicode.org/reports/tr29/#WB3c) and [WB3c](https://www.unicode.org/reports/tr29/#WB3c) interact in the same way [LB8a](https://www.unicode.org/reports/tr14/#LB8a) and [LB9](https://www.unicode.org/reports/tr14/#LB9) do. A correct implementation of that would require either duplicating every state as in https://github.com/unicode-org/icu4x/pull/4389, or hoisting the two...
Currently the rule-based break iteration rules are scattered around several places, all loosely derived from one another. This makes it difficult to update and causes breakages based on the inconsistencies...
The attributes no longer show up in datagen output. It is useful when they do.
CLDR and ICU4C both use July 15, 622 CE as the start of the epoch for the Islamic observational calendar: - https://github.com/unicode-org/cldr/blob/c1dc8c7ef6584668345cf741e51b1722d8114bc8/common/supplemental/supplementalData.xml#L4682-L4711 - https://github.com/unicode-org/icu/blob/3235c38f24711a86d7fc5adf5e536bf6a4f1d10a/icu4c/source/i18n/islamcal.h#L33-L36 Whereas ICU4X uses July 16, 622...
We've been using [Workload Identity Federation](https://cloud.google.com/iam/docs/workload-identity-federation) to access GCP resources from the main repository; it has been working smoothly, and this is the recommended approach by Google for GitHub Actions...
For characters that are their own decomposition, the least significant bit signifies "can combine backwards". As of Unicode 16, this information is also needed for complex decompositions, but the same...