parquet-cli
parquet-cli copied to clipboard
Command line (CLI) tool to inspect Apache Parquet files on the go
- This turns on two flags for the pandas table conversion to treat datetime/dates as objects which helps to avoid errors like: pyarrow.lib.ArrowInvalid: Casting from timestamp[us] to timestamp[ns] would result...
My use case is mostly to cat from s3 and pipe that into your parq utility. This is what I did to accept stdin if someone else want to work...
I am trying to install the package but getting below error: H:\>pip install parquet-cli Collecting parquet-cli Retrying (Retry(total=4, connect=None, read=None, redirect=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(10054, 'An existing...
Content of my command line while trying to get parquet-cli to run on Ubuntu 18.04.1 LTS: ``` machine:~$ pip install parquet-cli Collecting parquet-cli ... [installing stuff] Installing collected packages: pytz,...
https://github.com/chhantyal/parquet-cli/blob/588b738a3f1661cba62b11e6a1ac6fd6d709f0eb/parq/main.py#L62-L68 It's unintuitive to specify `True` or `False` after command line flags, which are traditionally used without further argument. I think you want the [`action='store_true'` behavior of `ArgumentParser`](https://docs.python.org/2/library/argparse.html#action). i.e. I...
Hi! Is there a workaround for the list datatypes? Here's the error I'm seeing: $ parq calls.parquet --schema Traceback (most recent call last): File "/home/clande/.local/bin/parq", line 10, in sys.exit(main()) File...
Bumps [pyarrow](https://github.com/apache/arrow) from 0.9.0.post1 to 14.0.1. Commits See full diff in compare view [data:image/s3,"s3://crabby-images/06d48/06d48f788d1bec52ac9d1eb150f36f0abd955bfb" alt="Dependabot compatibility score"](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter...