sling-cli
sling-cli copied to clipboard
Iceberg support
Hi,
Are there any considerations to add Apache Iceberg as a destination table format (on S3, GCS etc)?
Similar to the request that I've seen for Delta here
All the best
Todd
Thanks, adding some links:
- https://github.com/apache/iceberg-go
- https://pkg.go.dev/github.com/apache/arrow/go/[email protected]
- https://github.com/apache/arrow/tree/main/go
The easiest way may be to use DuckDB with Iceberg and executing it behind the scenes. https://duckdb.org/docs/extensions/iceberg.html (only does reads).
StarRocks does reads and writes to Iceberg.
Alternatively, iceberg support can be achieved through AWS Athena support, the way dtl
supports it https://dlthub.com/docs/dlt-ecosystem/destinations/athena#additional-destination-options
The problem is that it's not enough to support iceberg format. You also need to register those files/data into HMS so that projects like Trino and StarRocks can be a SQL query engine to the data within those formats.
Added in #357
See https://blog.slingdata.io/reading-apache-iceberg-data-with-sling
Closing.