parquet-testing icon indicating copy to clipboard operation
parquet-testing copied to clipboard

Adding testing page for Dictionary page after non-dict data page in same column-chunk

Open mapleFU opened this issue 10 months ago • 2 comments

The spec doesn't disable the case that:

Dict Page | Dict Index Page | Data Page | Dict Index Page   

We need check:

  1. Did we allowing this?
  2. Should we add testing case for the case

mapleFU avatar Feb 05 '25 03:02 mapleFU

cc @wgtmac

mapleFU avatar Feb 05 '25 03:02 mapleFU

I think this is allowed from the spec: A column chunk might be partly or completely dictionary encoded. I'm not sure if any open source Parquet writer is able to produced such kind of data.

wgtmac avatar Feb 05 '25 05:02 wgtmac