Fokko Driesprong

Results 140 issues of Fokko Driesprong

In Iceberg the tables are projected using field-IDs. Even if the column is renamed (and Iceberg is lazy, so existing data will not be rewritten), it should correct map the...

Most IO implementations allow the `file://` prefix to indicate that the data is local. Would be great if DuckDB can support this as well 👍

Iceberg has support for [hidden partitioning](https://iceberg.apache.org/docs/latest/partitioning/). Data written to a partitioned table will be split up based on the column and the [applied transform](https://iceberg.apache.org/spec/): ![image](https://github.com/duckdblabs/duckdb_iceberg/assets/1134248/6e715fb5-1e2d-4d25-8aa6-68b920b19749) A partitioned table can tremendously...

# Allow Flink to run without Hadoop This PR aims to remove Hadoop's `Configuration` class from the main code path, so we can also run Flink without having the Hadoop...

parquet
core
flink
build
stale

To assess the impact of the changes. Blocked by #8485

spark
parquet
arrow
flink
build
hive

### Feature Request / Improvement I noticed this one the other day: ``` TestDataFrameWrites > testFaultToleranceOnWrite() > format = orc FAILED java.io.IOException: Failed to delete temp directory /tmp/junit10947448527164022571. The following...

improvement
good first issue
spark

I'm not sure if I should ask this here, or at the `macros` repository. Consider the following setup: ```yaml # /mkdocs.yml site_name: Cats API nav: - Intro: 'index.md' - Authentication:...

### Describe the enhancement requested In Apache Iceberg we have support for the UUID type. I think it would be nice to also support this in (Py)Arrow natively instead of...

Type: enhancement
Component: Python

Not needed anymore :) Make sure you have checked _all_ steps below. ### Jira - [ ] My PR addresses the following [Parquet Jira](https://issues.apache.org/jira/browse/PARQUET/) issues and references them in the...