iceberg-python icon indicating copy to clipboard operation
iceberg-python copied to clipboard

Apache PyIceberg

Results 402 iceberg-python issues
Sort by recently updated
recently updated
newest added

Support writes with TimeTransforms. TimeTransforms are can be supported natively within pyarrow as pyarrow compute functions, without requiring conversions back and forth between Arrow and Python data types.

- defined reading Iceberg tables using the snowflake catalog. - snowflake catalog is pretty much read only, so adding primarily read only ops. - refer snowflake iceberg sdk read guide:...

### Apache Iceberg version main (development) ### Please describe the bug 🐞 When using the `add_files` table api, the parquet metadata needs to be read and a mapping of `Dict[str,...

### Feature Request / Improvement Python 3.12 has been prereleased, and it would be good to support it: https://www.python.org/downloads/ Migrated from https://github.com/apache/iceberg/issues/8499

### Feature Request / Improvement `register_table` is currently the only Catalog method not implemented https://github.com/apache/iceberg-python/blob/main/pyiceberg/catalog/glue.py#L407 Also the docs are misleading because this method is shown in the docs and doesn’t...

### Feature Request / Improvement Support Maintenance operations on PyIceberg: https://iceberg.apache.org/docs/1.4.0/maintenance All operations except for data file compaction are metadata-only or file system operations, so supporting them on PyIceberg may...

Bumps [pytest-checkdocs](https://github.com/jaraco/pytest-checkdocs) from 2.10.1 to 2.13.0. Changelog Sourced from pytest-checkdocs's changelog. v2.13.0 Features Updated collection to prefer 'file_path' over 'path'. (#23) v2.12.0 Features Rely on jaraco.packaging.metadata for loading metadata. (#19)...

dependencies
python

### Question I am just appending to the same table using the same data, ``` try: catalog.create_table("aemo.scada",schema=df.schema) except: catalog.load_table("aemo.scada").append(df) ``` but I am getting this error

### Feature Request / Improvement currently it seems `con =catalog.load_table('aemo.scada').scan().to_duckdb(table_name="scada") ` accept only 1 table, it will be nice to have something like `con =catalog.load_namespace('aemo').scan().to_duckdb()` which will expose the whole...

### Feature Request / Improvement It would be nice to eliminate the `sleep 10` and instead use health checks for the underlying containers. https://github.com/apache/iceberg-python/blob/188e2c4d63f294fc55351916fb8d62a5a4401d3d/Makefile#L43 The sleep here is a proxy...

stale