Larry Booker

Results 12 issues of Larry Booker

Add two methods of refreshing tables: - Manual refreshing - user specifies which snapshot to load and the engine will parse the snapshot to add/remove Iceberg data files as needed...

query engine
DocumentationNeeded
ReleaseNotesNeeded

To support production use cases, we need the following operators (also found in #4424): - median - percentile - var - cor - count_neg, count_pos - cum_std But also needed...

feature request
query engine
core
2023_unscheduled

### Java, connecting to a RESTCatalog using MinIO ``` import io.deephaven.iceberg.util.*; properties = new HashMap(); properties.put("type", "rest"); properties.put("uri", "http://rest:8181"); properties.put("io-impl", "org.apache.iceberg.aws.s3.S3FileIO"); properties.put("client.region", "us-east-1"); properties.put("s3.access-key-id", "admin"); properties.put("s3.secret-access-key", "password"); properties.put("s3.endpoint", "http://minio:9000"); adapter...

DocumentationNeeded
ReleaseNotesNeeded

bug
NoDocumentationNeeded
ReleaseNotesNeeded

Currently, the following data types are unsupported when loading an Iceberg table: - UUID - LIST - MAP - STRUCT We should be able to support UUID and LIST types....

feature request
triage

Recent PR (#6206 and #6143) changed `agg` and `update_by` formula methods to accept multiple columns as input but limited the operators to a single input formula (e.g. `agg.formula(formula="out_b=min(X)")`). It would...

feature request
triage

Initial provision is made to re-use a shared `GroupByChunkedOperator` in `FormulaMultiColumnChunkedOperator` and `FormulaChunkedOperator` but more work is needed before this can be fully used. 1) We need to optimize the...

feature request
query engine
core

This PR implements proper NaN handling for mathematical aggregation operations on float and double types. The key change is that NULL values are ignored during aggregation, while NaN values "poison"...

NoDocumentationNeeded
ReleaseNotesNeeded