csv2parquet
csv2parquet copied to clipboard
Create Parquet files from CSV
Hi, I am getting this error: File "csv2parquet.py", line 330, in drill = DrillInstallation() File "csv2parquet.py", line 203, in **init** assert reference_executable is not None AssertionError any help?
get this error: Traceback (most recent call last): File "./csv2parquet", line 330, in drill = DrillInstallation() File "./csv2parquet", line 203, in __init__ assert reference_executable is not None AssertionError
As per https://docs.python.org/3/library/os.html#os.rename "The operation may fail on some Unix flavors if src and dst are on different filesystems." I came across this trying to use csv2parquet across different mounts...
Adds `--delimiter` parameter, loads the file using: ``` TABLE(dfs.`/path/to/file`(type=>'text', fieldDelimiter => '')) ```
Hi, I am using csv2parquet to convert csv into parquet, but I get following decode problem. ``` ./csv2parquet all_cities.csv all_cities.parquet Traceback (most recent call last): File "./csv2parquet", line 329, in...
Hello, Is there a config option or CLI parameter (or something) that allows me to specify compression options - like SNAPPY, for example? Thanks!
Add a failure if python < 3.5 found
For example the following line has 5 fields with space as field delimiter: This is example "of Multi-line Field In One Csv" record