Henri Sivonen

Results 108 issues of Henri Sivonen

ICU4X's data representation doesn't have a place for marking the first character of an expansion as a starter that combines backwards. To accommodate the Unicode 16 characters discussed in section...

Gecko implements most of its IDNA processing on its own, but it currently uses ICU4C's `uidna_labelToUnicode` for UTS 46 processing. Code that wraps `ComposingNormalizer::try_new_uts46_without_ignored_and_disallowed_unstable` and augments it to provide the...

C-collator
U-gecko

C++20 added [`std::weak_ordering`](https://en.cppreference.com/w/cpp/utility/compare/weak_ordering), which logically corresponds to Rust's `core::cmp::Ordering`. AFAICT, Rust's `code::cmp::Ordering` has the bit representation of `i8` `-1`, `0`, and `1` while `std::weak_ordering` in practice the C++ `int` representation...

The initial glue code for providing the canonical composition and canonical decomposition operation to HarfBuzz assumes that the incoming `hb_codepoint_t` values might not be valid scalar values. We should investigate...

A-performance
C-collator

### What is the issue with the URL Standard? UTS 46 revision 31 added a _IgnoreInvalidPunycode_ flag to its ToASCII and ToUnicode operations. The URL Standard should be explicit about...

topic: idna

### What is the issue with the URL Standard? https://url.spec.whatwg.org/#concept-domain-to-ascii has a _beStrict_ flag to validation. This flag affects the _UseSTD3ASCIIRules_ and _VerifyDnsLength_ flags for UTS 46 processing, but don't...

topic: idna
topic: validation

### What is the issue with the URL Standard? The URL Standard gives advice about URL rendering: https://url.spec.whatwg.org/#ref-for-concept-domain-to-unicode%E2%91%A0 It also in the https://url.spec.whatwg.org/#concept-host-parser section says: "Alternatively [UTF-8 decode without BOM...

topic: idna
editorial

### What is the issue with the URL Standard? When reading https://url.spec.whatwg.org/#concept-domain-to-ascii in isolation of https://url.spec.whatwg.org/#concept-host-parser (and without reading ICU4C's uts46.cpp first), it's not at all apparent that 1) STD3...

topic: idna
editorial

### What is the issue with the URL Standard? When reading https://url.spec.whatwg.org/#idna in isolation to find out what kind of IDNA library surface the URL Standard needs, it's not apparent...

topic: idna
editorial
topic: validation

### What is the issue with the URL Standard? https://url.spec.whatwg.org/#valid-domain could use an informative note that states the implications of the two-step (both ToASCII and ToUnicode) check. Given the how...

topic: idna
editorial