encoding icon indicating copy to clipboard operation
encoding copied to clipboard

Reflect changes in GB 18030-2022

Open achristensen07 opened this issue 2 years ago • 5 comments

https://encoding.spec.whatwg.org/index-gb18030.txt contains 18 code points that have been changed by GB 18030-2022. We should probably update.

achristensen07 avatar Jun 06 '23 23:06 achristensen07

Related: #27 #57. It's not clear whether web browsers want to update these, or leave legacy encodings forever-unchanged.

domenic avatar Jun 07 '23 01:06 domenic

We, at least, definitely want to update (in fact, we already did). The definition of this encoding has been updated upstream. There needs to be a consistent behavior between the web browser on a platform and native apps on the platform. As the adage goes, "the future is longer than the past" and there will be more content produced with the new mappings than there is existing content. We can't just close our eyes and hope that all authors use UTF-8, especially when there are laws requiring that ~all products sold in certain places must conform.

litherum avatar Jun 14 '23 21:06 litherum

@litherum Did WebKit implement Unicode Technical Committee recommendation on this topic?

hsivonen avatar Jun 15 '23 06:06 hsivonen

Yes.

litherum avatar Jun 16 '23 00:06 litherum

We ended up moving away from that recommendation and going with the exact GB 18030-2022 mappings instead.

achristensen07 avatar Oct 10 '23 21:10 achristensen07