potatochipcoconut

Results 15 comments of potatochipcoconut

Hello, I'm interested in this feature and am trying to test out the strategy in a lambda environment. I am getting the following error during deserialization and unsure how to...

@koenvo how's it looking? I'm also interested in this feature.

thanks @koenvo looking forward to it!

@Fokko just curious, any eta on when this would land with next release?

@koenvo just tried this out but wondering if it'd be possible to have upsert available while updating [snapshot](https://github.com/apache/iceberg-python/blob/main/pyiceberg/table/update/snapshot.py#L604)? I'm hoping to implement upsert() for a workflow that uses the distributed...

I see, hmm, dang yeah I have a concern because I'm using a FIFO queue to process the data, but those have a limitation of 10 items per batch, so...

FYI tried this out in distributed write scenario where data files are serialized, then deserialized as described in link above, this is result when trying to do `txn.upsert()` ``` [ERROR]...

Came up with naive attempt, open to feedback. Not sure how it would handle e.g. int vs long, float vs double, etc ``` import builtins import datetime import uuid from...

@choucavalier agree it seems like a natural enhancement, possibly should be available directly through pyiceberg?

@choucavalier started playing with your lib and seems like generally what I was looking for, ran into issues around UUID (opened issues there and duckdb). Locally I integrated into my...