[BUG] Cannot open file when duplicate lines appear
Parquet Viewer Version V3.0.0.1
Where was the parquet file created? created it through pandas
Sample File File type not allowed
Describe the bug I created a very simple format and saved it using pandas' DataFrame.to-parquet, but now I am unable to open this file
Screenshots
tenant_id: [<class 'int'>]
mac_address: [<class 'str'>]
data_type: [<class 'str'>]
confidence: [<class 'int'>]
raw_data: [<class 'str'>]
result_data: [<class 'str'>]
source_data_types: [<class 'str'>]
timestamp: [<class 'float'>]
Additional context Add any other context about the problem here.
Any chance you could zip and share a sample parquet file? GitHub allows uploading .zip files so zipping the parquet file is an easy way to upload the file directly here.
Assuming the file is not corrupted, this type of error typically indicates a change in the parquet format that needs to also be applied to the Parquet.NET library. Example: https://github.com/aloneguid/parquet-dotnet/issues/603
Of course, thank you very much for your reply. This is an example demo file, which consists of two lines. You can open it with pandas.
Hey @GuoweiTops, turns out this issue has been fixed since v3.2.1.
Please download the latest release to open your files: https://github.com/mukunku/ParquetViewer/releases