archive
archive copied to clipboard
Decompressed error, Bad UTF-8 encoding on Chinese
Bug on 2.0.10 with Chinese folder, the example file is attached. The error catched:
Bad UTF-8 encoding
The file is small, so I attach It here. rawMap.zip
I'm pretty sure It's the problem of Chinese decoding, because I deleted all the Chinese file and renamed the Chinese folder, and then run again, everything is working as expected.
Your zip file looks like GBK or GB2312 encoding.
As far as I know, 'archive' only support UTF-8. Use UTF-8 instead.
Maybe 'archive' could guess encoding like tool 'unar', or provide encoding choice like 'unzip-iconv': unzip -O gbk test.zip
? @brendan-duncan
@javanli, Just for clearification, Big5 is used in Traditional Chinese (zip Attachment), and Both GBK and GB2312 are for Simplified Chinese. In additional, It's hard to let user follow a rule strictly, hope there is a solution or workaround to slove this issuse instead of prohibition 😞
@Tokenyet My way is rewrite InputStream.readString
, try different codecs to decode the buffer.