ParquetViewer icon indicating copy to clipboard operation
ParquetViewer copied to clipboard

[BUG] Cannot open file when duplicate lines appear

Open GuoweiTops opened this issue 7 months ago • 2 comments

Parquet Viewer Version V3.0.0.1

Where was the parquet file created? created it through pandas

Sample File File type not allowed

Describe the bug I created a very simple format and saved it using pandas' DataFrame.to-parquet, but now I am unable to open this file

Screenshots Image tenant_id: [<class 'int'>] mac_address: [<class 'str'>] data_type: [<class 'str'>] confidence: [<class 'int'>] raw_data: [<class 'str'>] result_data: [<class 'str'>] source_data_types: [<class 'str'>] timestamp: [<class 'float'>]

Additional context Add any other context about the problem here.

GuoweiTops avatar Jun 03 '25 07:06 GuoweiTops

Any chance you could zip and share a sample parquet file? GitHub allows uploading .zip files so zipping the parquet file is an easy way to upload the file directly here.

Assuming the file is not corrupted, this type of error typically indicates a change in the parquet format that needs to also be applied to the Parquet.NET library. Example: https://github.com/aloneguid/parquet-dotnet/issues/603

mukunku avatar Jun 03 '25 12:06 mukunku

parquetdemo.zip

Of course, thank you very much for your reply. This is an example demo file, which consists of two lines. You can open it with pandas.

GuoweiTops avatar Jun 04 '25 01:06 GuoweiTops

Hey @GuoweiTops, turns out this issue has been fixed since v3.2.1.

Please download the latest release to open your files: https://github.com/mukunku/ParquetViewer/releases

mukunku avatar Aug 10 '25 16:08 mukunku