Unishox2 icon indicating copy to clipboard operation
Unishox2 copied to clipboard

Can this compression library be used when the input encoding is not UTF8 or the character set is not Unicode?

Open qnxsgwy opened this issue 2 years ago • 1 comments

Can this compression library be used when the input encoding is not UTF8(eg. short binary data) or the character set is not Unicode(eg. gb2312 big5 Shift_JIS)?

qnxsgwy avatar Feb 03 '23 01:02 qnxsgwy

Presently the code has been written only for UTF-8. It can be implemented for UTF-16 and other encoding methods. It will benefit some of the languages, but there may not be much savings in space for languages such as Chinese, Japanese and Korean as the number of characters are high.

siara-cc avatar Feb 03 '23 08:02 siara-cc