Unishox2
Unishox2 copied to clipboard
Can this compression library be used when the input encoding is not UTF8 or the character set is not Unicode?
Can this compression library be used when the input encoding is not UTF8(eg. short binary data) or the character set is not Unicode(eg. gb2312 big5 Shift_JIS)?
Presently the code has been written only for UTF-8. It can be implemented for UTF-16 and other encoding methods. It will benefit some of the languages, but there may not be much savings in space for languages such as Chinese, Japanese and Korean as the number of characters are high.