ParquetViewer icon indicating copy to clipboard operation
ParquetViewer copied to clipboard

[FEATURE-REQUEST] Ability to open partitioned files

Open tigerhawkvok opened this issue 4 years ago • 1 comments

Parquet Viewer Version

2.3.1.41849

Where was the parquet file created?

Pandas -> pyarrow

dfStore.to_parquet(BUILDINGS_OUTPUT_FILE, partition_cols= ["type"])

Sample File

pv_bugdemo.parquet.zip

Describe the bug

A partitioned file that is actually a folder with several subfiles should be supported. This probably involves checking if the "file" is actually a directory then traversing the tree to read the individual constituent files.

Note: This tool relies on the parquet-dotnet library for all the actual Parquet processing. So any issues where that library cannot process a parquet file will not be addressed by us. Please open a ticket on that library's repo to address such issues.

tigerhawkvok avatar Jun 03 '21 22:06 tigerhawkvok

This is a big feature to implement. Not sure if it can be done as it's really complicated. Leaving this ticket open for now in case anyone wants to tackle this behemoth.

For the time being, you'll have to open the files one by one. Or make sure you save them as a single file instead of a partitioned one.

mukunku avatar Jun 13 '21 16:06 mukunku

Support to open partitioned files has been finally added! https://github.com/mukunku/ParquetViewer/releases/tag/v2.6.0.2

mukunku avatar Mar 12 '23 22:03 mukunku

I could not open the sample files that are the first post with the latest version

After loading, the program closes completely

AFgh24 avatar Mar 28 '23 15:03 AFgh24