paimon
paimon copied to clipboard
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
### Purpose The serialization entry of cdc kafka_table/database_sync is CdcJsonDeserializationSchema, but CdcJsonDeserializationSchema#deserialize returns CdcSourceRecord(JsonNode). When users extend other format, there will be limitations. Most data serialization is based on byte[],...
Compatible with Spark4
### Purpose When call `findOverlappedSnapshots(List taggedSnapshots, long beginInclusive, long endExclusive)`. If beginInclusive is 10, endExclusive is 15, the range is [10, 15), and the taggedSnapshots is [8, 9, 11, 12,...
### Purpose In #3731 we introduce `IcebergCommitCallback` to create Iceberg metadata. In this PR we improve this class so that it can reuse Iceberg manifest files created before to create...
### Purpose Bump testcontainers-java to latest release (1.19.8) https://github.com/testcontainers/testcontainers-java/releases https://java.testcontainers.org ### Tests ### API and Format ### Documentation
### Purpose cdc schema change shoud add column comment Linked issue: close #3666 ### Tests testSchemaEvolutionWithComment ### API and Format ### Documentation
### Purpose Linked issue: close #3654 ### Tests ### API and Format ### Documentation
### Purpose Linked issue: close #3599 ### Tests - nothing to change ### API and Format - change cdc pom to org.apache.flink - change package name **com.ververica.cdc** to **org.apache.flink** ###...
### Purpose Linked issue: close #xxx ### Tests ### API and Format ### Documentation
### Purpose Linked issue: close #3521 Support flink cache lookup operator ### Tests Added test case LookupJoinITCase#testLookupTableWithCache ### API and Format no ### Documentation no