artful-newlisp icon indicating copy to clipboard operation
artful-newlisp copied to clipboard

CSV parser chokes on UTF-8 BOM (byte order mark)

Open ghost opened this issue 4 years ago • 0 comments

2019-07-18 Claude Marinier

Look for and remove a UTF-8 BOM (byte order mark). It's a special three byte sequence. It appears as part of the first item. Without handling it, the program cannot find a name heading in the first position.

The BOM is 0xEF, 0xBB, 0xBF

Discovered this while processing a database dump from an asset management system.

P. S. Sorry about the late posting and the mangled diff file.

csv-1.2.1-diff.txt

ghost avatar Aug 19 '20 15:08 ghost