framework
framework copied to clipboard
Ability to skip chars that breaks encoding?
Overview
Currently, the parser will stop once it hits a bad encoding exception. It happens because of the Python streaming model and, basically, the impossibility of skipping just not-recognized chars instead of the whole data chunks as we stream data.
This issue is to investigate the ability to handle files with broken encoding.