iceberg issues

Respect spark.catalog.currentDatabase instead of hardcoded "default"

3

In the event that a database isn't defined the behavior falls back to using a hardcoded "default" database. Instead we should respect spark.catalog.currentDatabase. I have a simple pull request prepared...

NJordan72

Convert TestReadProjection/TestSparkReadProjection to use Spark's InternalRow

1

In starting to look at working on Iceberg's schema evolution for ORC, the current test case is full of Avro's types/data structures. That doesn't work at all for ORC, because...

omalley

Implement column size metrics for ORC

1

This is blocked on [ORC-305](https://issues.apache.org/jira/browse/ORC-305) and a release that contains it.

omalley

Support timestamps with timezone for ORC

1

We need to support timestamps with timezone for ORC. This is blocked by [ORC-189](https://issues.apache.org/jira/browse/ORC-189) and a release that contains it.

omalley

rdblue

iceberg
iceberg copied to clipboard

Metadata

Respect spark.catalog.currentDatabase instead of hardcoded "default"

Convert TestReadProjection/TestSparkReadProjection to use Spark's InternalRow

Implement column size metrics for ORC

Support timestamps with timezone for ORC

ORC: validate values are not null for required columns

Support schema evolution for ORC

Prototype HLL buffers in manifest files to provide column distinct estimates.

← Metadata

Owner

Metadata

iceberg iceberg copied to clipboard

Metadata

Respect spark.catalog.currentDatabase instead of hardcoded "default"

Convert TestReadProjection/TestSparkReadProjection to use Spark's InternalRow

Implement column size metrics for ORC

Support timestamps with timezone for ORC

ORC: validate values are not null for required columns

Support schema evolution for ORC

Prototype HLL buffers in manifest files to provide column distinct estimates.

← Metadata

Owner

Metadata

iceberg
iceberg copied to clipboard