Python: Add CLI support for adding existing parquet file to table.
Feature Request / Improvement
The request is for the CLI to be able to add a parquet file (created by another tool) to an existing Iceberg table. The closest equivalent functionality would be the add_files Spark procedure.
The idea is that some/any other tool can write a parquet file to a location in the table directory and then invoke the CLI to import that into the table with metadata-only operations. Specifically, the CLI just creates the manifests and snapshot metadata and can optionally validate the schema of the parquet file.
Motivation: this would be an easy way to increase composability of Iceberg with other tools and serve as a waypoint on the way to getting full support for adding data via the python SDK.
Query engine
No response
This would be great! I think it's a little ways out since we can't yet write metadata files. But I think this should be an early target because it is basically the append path.
This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.
This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'