crump icon indicating copy to clipboard operation
crump copied to clipboard

Many source files are improperly encoded

Open waldoj opened this issue 9 years ago • 0 comments

Corp.csv, LLC.csv, Name.History.csv, and Officer.csv all claim to be UTF-8, but contain invalid characters. (They're all people's names, and I guarantee you that well over 90% of those people are black or Latino. So.) They can be found via grep -axv '.*' filename.csv.

waldoj avatar Jul 26 '16 01:07 waldoj