csv2parquet
csv2parquet copied to clipboard
Convert a CSV to a parquet file.
Bumps [codecov](https://github.com/codecov/codecov-python) from 2.0.15 to 2.0.16. Changelog Sourced from codecov's changelog. 2.0.16 fixed reported command injection vulnerability. Commits 3a8b06b Version 2.0.16 b2951c0 Merge pull request #231 from codecov/ce-1380 2a80aa4 CE-1380_sanitize_args...
Bumps [numpy](https://github.com/numpy/numpy) from 1.17.4 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...
It's possible in pyarrow to specify the compression level. This CLI should support the option to set compression level manually.
I like to stream a .tsv in from stdin through `/dev/stdin` but that means the automatic delim detection doesn't work. Delimiter should be a cli option.
If I read in a file that's a few GB big, it crashes my computer with 16GB because it swaps like crazy. The whole input file should never be in...
Hi, Is it possible to run csv2parquet in a way that means it appends to an existing parquet file? I have a large number of CSV file with a fixed...
I met this issue, TypeError: Cannot convert pyarrow.lib.ChunkedArray to pyarrow.lib.Array Could I get any helps, please?
Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.25.7 to 1.26.5. Release notes Sourced from urllib3's releases. 1.26.5 :warning: IMPORTANT: urllib3 v2.0 will drop support for Python 2: Read more in the v2.0 Roadmap Fixed...
Bumps [py](https://github.com/pytest-dev/py) from 1.8.0 to 1.10.0. Changelog Sourced from py's changelog. 1.10.0 (2020-12-12) Fix a regular expression DoS vulnerability in the py.path.svnwc SVN blame functionality (CVE-2020-29651) Update vendored apipkg: 1.4...
Dealing with a 69M row CSV that has a column that I've set as a float. Occasionally there are rows with empty values which was throwing a ValueError. This PR...