paimon icon indicating copy to clipboard operation
paimon copied to clipboard

[Feature] Exposing Paimon tags to Iceberg metadata

Open junmuz opened this issue 10 months ago • 1 comments

Search before asking

  • [x] I searched in the issues and found nothing similar.

Motivation

Currently, the Paimon tags are not exposed to the Iceberg metadata when data is written in Iceberg compatibility mode. This means querying any historical tags may not work with AWS Athena. I am using the 1.1 version Spark Jars. I have verified the following with Spark SQL as attached in the file. Right now it throws the error.

25/02/26 08:25:01 ERROR SparkSQLDriver: Failed in [select * from default.cities version as of 'test_tag']
org.apache.iceberg.exceptions.ValidationException: Cannot find matching snapshot ID or reference name for version test_tag

paimon_tagging.txt

Solution

The proposed solution at a high level would mean exposing the Paimon tags as they are created to Iceberg metadata. Also, we'd have to clean out the tags once they are deleted.

Anything else?

No response

Are you willing to submit a PR?

  • [x] I'm willing to submit a PR!

junmuz avatar Feb 27 '25 13:02 junmuz

+1

JingsongLi avatar Mar 03 '25 06:03 JingsongLi

+1

wwj6591812 avatar May 07 '25 01:05 wwj6591812

one more PR for this issue https://github.com/apache/paimon/pull/5666

otherwise it is complete

nickdelnano avatar May 26 '25 16:05 nickdelnano

Please close this issue, it is complete

nickdelnano avatar Jul 24 '25 21:07 nickdelnano