Consolidate test data someplace with clear geoparquet versioning
It would be nice to have example stac-geoparquet files clearly stored by parquet version someplace:
test-data
├── 0.4
│ └── naip.parquet
└── 1.1
└── 100-sentinel-2-items.parquet
Currently the test file in this repository is geoparquet v0.4: https://github.com/stac-utils/stac-geoparquet/blob/main/tests/data/naip.parquet
I was thrown off using the v0.4 file earlier (https://github.com/stac-utils/stac-geoparquet/discussions/83#discussioncomment-11387418) and GDAL also points to this file as an example: https://gdal.org/en/stable/drivers/raster/gti.html#stac-geoparquet-support, even though it only in theory supports GeoParquet 1+
Some other 1.1.0 examples that I'm aware of off the top of my head. I'm sure there are others being used across projects!
-
https://raw.githubusercontent.com/developmentseed/labs-375-stac-geoparquet-backend/refs/heads/main/data/naip.parquet
-
https://github.com/stac-utils/rustac-py/raw/refs/heads/main/data/100-sentinel-2-items.parquet
Adding (versioned) examples sounds good. I normally don't like large binary files in git repos, but I imagine that these can be very small: just the metadata (which is really what matters) and a row or two for demonstration.
Or a stac-geoparquet-data repo?
https://github.com/stac-utils/stac-geoparquet-data is created, if anyone wants to make a PR :)