hudi
hudi copied to clipboard
[HUDI-7243] TVF to query hudi timeline through spark sql
A new TVF, hudi_timeline_query(...) is added to support querying timeline through spark-sql. The information displayed is influenced by the 'Commits' command of hudi-cli (with some additional information).
A new relation, TimelineRelation, is added to transparently support this functionality. The relation implements buildScan(...) method of TableScan trait. It does not support filter or predicate push-down. Column filtering and predicate evaluation needs to be done by the execution layer. This seems reasonable for the initial implementation for this tool which is mainly going to be used as a debugging/introspection tool. The relation defines a fixed schema based on the information present ina Hudi timeline instant.
Change Logs
A new TVF, hudi_timeline_query(...) is added to support querying timeline through spark-sql. The information displayed is influenced by the 'Commits' command of hudi-cli (with some additional information).
A new relation, TimelineRelation, is added to transparently support this functionality. The relation implements buildScan(...) method of TableScan trait. It does not support filter or predicate push-down. Column filtering and predicate evaluation needs to be done by the execution layer. This seems reasonable for the initial implementation for this tool which is mainly going to be used as a debugging/introspection tool. The relation defines a fixed schema based on the information present ina Hudi timeline instant.
Impact
New TVF function is added to query hudi timeline through spark-sql.
Risk level (write none, low medium or high below)
Low
Documentation Update
TBD
Describe any necessary documentation update if there is any new feature, config, or user-facing change
- The config description must be updated if new configs are added or the default value of the configs are changed
- Any new feature or user-facing change requires updating the Hudi website. Please create a Jira ticket, attach the ticket number here and follow the instruction to make changes to the website.
Contributor's checklist
- [ ] Read through contributor's guide
- [ ] Change Logs and Impact were stated clearly
- [ ] Adequate tests were added if applicable
- [ ] CI passed
CI report:
- bb48cd2b3363f1a83248fa8d8ff655115c43a14e Azure: FAILURE
Bot commands
@hudi-bot supports the following commands:@hudi-bot run azurere-run the last Azure build