cChardet icon indicating copy to clipboard operation
cChardet copied to clipboard

LookupError: unknown encoding: EUC-TW

Open meshy opened this issue 9 years ago • 3 comments

This seems similar in nature to https://github.com/PyYoshi/cChardet/issues/8, but unfortunately, I do not know what to recommend as an alternative to EUC-TW.

One can see that there is nothing that quite matches in Python's list of standard encodings.

I also thought that I should look through the other encodings mentioned in the readme, and found that there are a number of other codecs that did not come up in the list:

Do you have any recommendations for how I could decode strings that are detected as these types in python?

meshy avatar Oct 08 '15 22:10 meshy

Further investigation has revealed that python wont fix EUC-TW and ISO-2022-CN encodings.

meshy avatar Oct 09 '16 08:10 meshy

Just ran into this myself.

tzeppy avatar Feb 08 '19 20:02 tzeppy

  • X-ISO-10646-UCS-4-2143

  • X-ISO-10646-UCS-4-3412

See https://stackoverflow.com/q/18518730

rstm-sf avatar Nov 25 '19 18:11 rstm-sf