luajson icon indicating copy to clipboard operation
luajson copied to clipboard

Add support for processing surrogate pairs in sequenced \uD800\uDC00 items

Open harningt opened this issue 14 years ago • 1 comments

Surrogate pairs as described in the title will be decomposed into a pair of 3-byte utf-8 encoded chunks that are technically invalid. These should be decomposed into a 4-byte utf-8 encoded value for the represented codepoint.

harningt avatar Feb 08 '11 04:02 harningt

+1, this would be awesome.

mtdowling avatar Nov 03 '14 05:11 mtdowling