paimon icon indicating copy to clipboard operation
paimon copied to clipboard

[Feature] Support schema evolution for nested struct add new children fields.

Open gnailJC opened this issue 1 year ago • 1 comments

Search before asking

  • [X] I searched in the issues and found nothing similar.

Motivation

Support schema evolution about nested data type such as StructType(Spark). For example: Struct<name: String, age: Int> to Stuct<name: String, age: Int, phone: Str>

Solution

Anything else?

Are you willing to submit a PR?

  • [ ] I'm willing to submit a PR!

gnailJC avatar Aug 08 '24 12:08 gnailJC

@JingsongLi Are there any plans to support this feature?

gnailJC avatar Aug 13 '24 03:08 gnailJC

I've been looking to add support for schema evolution of nested types for my use case in Flink CDC API. My plan is as follows

  • Extend the checks in canConvert to return the appropriate ConvertAction for nested types broadly following the Avro backwards compatibility schema resolution rules.
  • The 1.0.1 release introduced support for compatible schema evolution of nested types but currently only works for FlinkSQL. We can plug into the implementation of various NestedColumnModifier in SchemaManager from flink-cdc.
  • Fix https://github.com/apache/paimon/issues/5640

0dunay0 avatar May 20 '25 11:05 0dunay0