Henri Sivonen issues

Results 80 issues of


                                            Henri Sivonen

Debuggable Macro Expansions

# Proposal (Previously an [RFC](https://github.com/rust-lang/rfcs/pull/2117).) ## Summary By default, annotate code expanded from a macro with debug location info corresponding to the macro definition (i.e. the behavior that's currently available...

T-compiler

final-comment-period

major-change

Convert sorted data arrays to the Eytzinger order

The [Eytzinger order](https://arxiv.org/abs/1509.05053) improved things for Gecko's HTML parser. Data tables that are currently only searched by binary search should probably be converted to the Eytzinger order.

enhancement

What emails are stored locally and "owned" by Mailpile is hard to grok

I disabled an IMAP mail source and changed its setting so that it couldn't connect to the server it previously connected to. To my dismay, this made all the labels...

Back End

Make Swedish default collation handling able to deal with CLDR 41 and with CLDL 42 or later

Swedish has a naming oddity that's [hard-coded](https://github.com/unicode-org/icu4x/blob/main/provider/datagen/src/transform/icuexport/collator/mod.rs#L63-L82) in datagen. [CLDR is fixing the oddity.](https://unicode-org.atlassian.net/browse/CLDR-15603) We should have something better than the current hard-coding so that ICU4X 1.0 datagen will be...

C-data-infra

C-collator

Fallback to "search" for non-matching collation type starting with "search"

Section [3.1.1 Collation Type Fallback](https://unicode.org/reports/tr35/tr35-collation.html#311-collation-type-fallback) in LDML Collation step 4 says: > If it does not exist, and the type starts with "search" but is longer, then set the type...

T-bug

C-data-infra

S-small

Ensure that collations that use the import mechanism instead of alias are properly deduplicated

#1965 is about the CLDR collation alias mechanism. Some collations are logically aliases but are implemented as duplicated data via import. Ensure that these work and the data is deduplicated:...

C-data-infra

Ensure that the provider performs correct non-Chinese collation alias mapping

Ensure that the provider is performs these alias mappings from CLDR for collations (Traditional Chinese and Norwegian have more specific issues): pa_IN: pa_Guru_IN sr_RS: sr_Cyrl_RS ars: ar_SA in_ID: id_ID iw:...

C-data-infra

Ensure that the provider performs correct alias mapping for Traditional Chinese locales

Ensure that if a specific (and existing) collation hasn't been specified with `-u-co-`, the following map to `zh-u-co-stroke`: * `zh-Hant` regardless of region. * `zh` without `Hans` but with any...

C-data-infra

S-epic

U-ecma402

Remove the removal of irrelevant locale extensions in the collator once the provider does it

Thought: After this is merged, we can take care of the "remove irrelevant extensions" piece within the data provider vertical fallback. _Originally posted by @sffc in https://github.com/unicode-org/icu4x/pull/1706#discussion_r876486611_

C-data-infra

S-small

C-collator

Provide a trie-based alternative to UnicodeSet

The ICU4X composing normalizer uses a `UnicodeSet` for a fast-path pass-through check while the ICU4C composing normalizer uses a code point trie lookup. ICU4C ends up being faster ever after...

T-enhancement

help wanted

backlog

A-performance

C-unicode