paimon icon indicating copy to clipboard operation
paimon copied to clipboard

[Feature] Append table support distribution mode before write to reduce small files

Open eric666666 opened this issue 1 year ago • 0 comments

Search before asking

  • [X] I searched in the issues and found nothing similar.

Motivation

Currently Paimon's append table is written directly by Flink writer operator subtask which will cause too many small files. We can set some shuffle conditions before writing to reduce the generation of small files, especially stream processing.Such as key by partition.

Solution

No response

Anything else?

No response

Are you willing to submit a PR?

  • [X] I'm willing to submit a PR!

eric666666 avatar Jul 30 '24 06:07 eric666666