json2parquet
json2parquet copied to clipboard
Add type conversion when given a schema
People do weird stuff in JSON. A lot of systems kinda figure stuff out. Redshift will convert strings to INTs for you etc. PyArrow purposely avoids doing unexpected stuff, and would rather fail. We should add support for forcing proper types on JSON data before ingesting it with PyArrow, if we are given a schema.