UTF-unknown
UTF-unknown copied to clipboard
UTF-8 file is detected as Windows-1252 (western) (SBCSCodePageEncoding)
Input: Text.txt Text: "ND Driver’s License DOE111111"
Output encoding: System.Text.SBCSCodePageEncoding Expected encoding: UTF8
related https://github.com/CharsetDetector/UTF-unknown/issues/168
details
tested it, 1252 is indead wrong