Henri Sivonen
Henri Sivonen
# Proposal (Previously an [RFC](https://github.com/rust-lang/rfcs/pull/2117).) ## Summary By default, annotate code expanded from a macro with debug location info corresponding to the macro definition (i.e. the behavior that's currently available...
The [Eytzinger order](https://arxiv.org/abs/1509.05053) improved things for Gecko's HTML parser. Data tables that are currently only searched by binary search should probably be converted to the Eytzinger order.
I disabled an IMAP mail source and changed its setting so that it couldn't connect to the server it previously connected to. To my dismay, this made all the labels...
Swedish has a naming oddity that's [hard-coded](https://github.com/unicode-org/icu4x/blob/main/provider/datagen/src/transform/icuexport/collator/mod.rs#L63-L82) in datagen. [CLDR is fixing the oddity.](https://unicode-org.atlassian.net/browse/CLDR-15603) We should have something better than the current hard-coding so that ICU4X 1.0 datagen will be...
Section [3.1.1 Collation Type Fallback](https://unicode.org/reports/tr35/tr35-collation.html#311-collation-type-fallback) in LDML Collation step 4 says: > If it does not exist, and the type starts with "search" but is longer, then set the type...
#1965 is about the CLDR collation alias mechanism. Some collations are logically aliases but are implemented as duplicated data via import. Ensure that these work and the data is deduplicated:...
Ensure that the provider is performs these alias mappings from CLDR for collations (Traditional Chinese and Norwegian have more specific issues): pa_IN: pa_Guru_IN sr_RS: sr_Cyrl_RS ars: ar_SA in_ID: id_ID iw:...
Ensure that if a specific (and existing) collation hasn't been specified with `-u-co-`, the following map to `zh-u-co-stroke`: * `zh-Hant` regardless of region. * `zh` without `Hans` but with any...
Thought: After this is merged, we can take care of the "remove irrelevant extensions" piece within the data provider vertical fallback. _Originally posted by @sffc in https://github.com/unicode-org/icu4x/pull/1706#discussion_r876486611_
The ICU4X composing normalizer uses a `UnicodeSet` for a fast-path pass-through check while the ICU4C composing normalizer uses a code point trie lookup. ICU4C ends up being faster ever after...