--help should document options to --encoding
I just ran into an issue with csvjoin 1.0.5 where a file was failing to parse because
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc1 in position 377: invalid start byte
It didn't like Á.
In the end the solution was --encoding iso-8859-1, but it was really frustrating that the help said there was an encoding flag and nothing about how to use it.
Right now the --help has:
-e ENCODING, --encoding ENCODING
Specify the encoding of the input CSV file.
What do you suggest it say?
Either have a list of formats in the help there, or add a way to list formats.
ffmpeghas-formatsconvertdoes-list formats
I think --encoding with no argument would be most reasonable, but also not as easy as just putting the list in the help.
In the meantime, you can refer to the available encodings in Python here: https://docs.python.org/3/library/codecs.html#standard-encodings
Python apparently has no easy way to report the available encodings on a given platform.
I instead just added a link to those docs from https://csvkit.readthedocs.io/en/latest/common_arguments.html
Cool, thanks.