paimon icon indicating copy to clipboard operation
paimon copied to clipboard

[Feature] paimon_incremental_to_auto_tag return whole data if earlier tag does not exist

Open JackeyLee007 opened this issue 10 months ago • 0 comments

Search before asking

  • [x] I searched in the issues and found nothing similar.

Motivation

The tables with pk are ingested into paimon with batch processing tasks initialy, then ingested in real time. And the real time operation can create tags periodically. For the real time part, we could use paimon incremental query to get the difference between tags, and process them incrementally. The initial data, on the other hand, could not be retrieved through the incremental query. We have to use a total different way, like selecting all data without filter condition. That increase the complexity of data processing method.

Solution

The would be better if the paimon incremental query returns the whole data when missing earlier tags. If so, we could use the same method to process the initial part and real time part.

-- If the tags earlier than 2024-12-04 doesn't exist, return whole data instead of empty.
SELECT * FROM paimon_incremental_to_auto_tag('tableName', '2024-12-04');

Anything else?

No response

Are you willing to submit a PR?

  • [ ] I'm willing to submit a PR!

JackeyLee007 avatar Feb 16 '25 11:02 JackeyLee007