Larry Booker
Larry Booker
Add two methods of refreshing tables: - Manual refreshing - user specifies which snapshot to load and the engine will parse the snapshot to add/remove Iceberg data files as needed...
To support production use cases, we need the following operators (also found in #4424): - median - percentile - var - cor - count_neg, count_pos - cum_std But also needed...
### Java, connecting to a RESTCatalog using MinIO ``` import io.deephaven.iceberg.util.*; properties = new HashMap(); properties.put("type", "rest"); properties.put("uri", "http://rest:8181"); properties.put("io-impl", "org.apache.iceberg.aws.s3.S3FileIO"); properties.put("client.region", "us-east-1"); properties.put("s3.access-key-id", "admin"); properties.put("s3.secret-access-key", "password"); properties.put("s3.endpoint", "http://minio:9000"); adapter...
Currently, the following data types are unsupported when loading an Iceberg table: - UUID - LIST - MAP - STRUCT We should be able to support UUID and LIST types....
Recent PR (#6206 and #6143) changed `agg` and `update_by` formula methods to accept multiple columns as input but limited the operators to a single input formula (e.g. `agg.formula(formula="out_b=min(X)")`). It would...
Initial provision is made to re-use a shared `GroupByChunkedOperator` in `FormulaMultiColumnChunkedOperator` and `FormulaChunkedOperator` but more work is needed before this can be fully used. 1) We need to optimize the...
This PR implements proper NaN handling for mathematical aggregation operations on float and double types. The key change is that NULL values are ignored during aggregation, while NaN values "poison"...