Henri Sivonen

Results 108 issues of Henri Sivonen

I have [out-of-tree normalization benchmarks](https://github.com/hsivonen/norm_bench) that use Wikipedia content and that take a rather long time to run. So far, experience suggests that English and Greek normalization performance are particularly...

C-test-infra
C-collator

`CollationElements::upcoming` currently holds `CharacterAndClassAndTrieValue`. Logically its last element (when not at the end of the stream) needs to hold `CharacterAndTrieValue` (which currently doesn't exist in the collator but exists in...

A-performance
T-techdebt
S-medium
C-collator

#2378 uses a pattern where there is a full slice and another slice that's known to be its suffix, and the prefix of the full slice is computed either so...

A-performance
S-small
C-collator

[Instead of pushing in a loop, use extend.](https://github.com/unicode-org/icu4x/pull/2481#discussion_r958400225)

good first issue
S-tiny
C-collator

[`icu_normalizer::properties`](https://unicode-org.github.io/icu4x-docs/doc/icu_normalizer/properties/index.html) has API surface whose purpose is to cater to HarfBuzz. Additionally, `icu_properties` has functionality required by HarfBuzz. [`hb-unicode.h`](https://searchfox.org/mozilla-central/source/gfx/harfbuzz/src/hb-unicode.h) defines the callbacks that need implementations. These are C-linkage function pointers,...

C-unicode
S-small
C-collator

Add const-evaluable `CharULE::from_array(chars: [char; N]) -> [CharULE; N]`. (I'm not sure if const evaluation is powerful enough to do this.)

discuss

(It was previously concluded in a meeting that it's not a post-1.0 breaking change to switch from bogus data causing GIGO to bogus data causing a constructor to err out.)...

discuss
C-collator

AFAICT https://github.com/web-platform-tests/wpt/blob/ab53b6abd671656733e41ef03cd0be11250688d7/content-security-policy/inheritance/history-iframe.sub.html#L53,L63 and https://github.com/web-platform-tests/wpt/blob/aa5400a53c5d9c90cbb16d5f2eb25c4c98801dae/content-security-policy/inheritance/window-open-local-after-network-scheme.sub.html#L51-L79 expect _non-initial_ about:blank to get its DOM built in the same task that creates the document object for the _non-initial_ about:blank. Since this appears to be...

[`getDefaultMIMECharset()`](https://github.com/eclipse-ee4j/javamail/blob/beab9265aa39a4c3c4a2e220b1e4c198060803a5/mail/src/main/java/javax/mail/internet/MimeUtility.java#L1316) tries to use a charset from a system property and falls back on the system encoding. It would make more sense to use UTF-8 unconditionally. MUAs have been able...

Consider providing functionality (with `std`, not with `no_std`) for retrieving the user's system-level preference list of languages as ICU4X locales. On Windows, Gecko prefers https://learn.microsoft.com/en-us/uwp/api/windows.system.userprofile.globalizationpreferences.languages?view=winrt-22621#windows-system-userprofile-globalizationpreferences-languages and adds region with likely...

good first issue
C-locale
S-large
U-ecma402