hudi
hudi copied to clipboard
[HUDI-4881] Push down filters if possible when syncing partitions to Hive
Change Logs
If possible, try to push down filters to Hive to avoid fetching too many partitions.
Impact
Describe any public API or user-facing feature change or any performance impact.
Risk level: low
Choose one. If medium or high, explain what verification was done to mitigate the risks.
Contributor's checklist
- [ ] Read through contributor's guide
- [ ] Change Logs and Impact were stated clearly
- [ ] Adequate tests were added if applicable
- [ ] CI passed
Canceling the CI run to prioritize release blockers. Apologies.
@hudi-bot run azure
@codope @yihua @alexeykudinkin @xushiyan Hi, could you plz take a look this improvement?
Gentle ping @codope @yihua @xushiyan
@boneanxs apologies for the delay. will check soon
@alexeykudinkin @xushiyan could you please review the new commit? The test failure is 137, not relate to this pr.
@hudi-bot run azure
CI report:
Bot commands
@hudi-bot supports the following commands:@hudi-bot run azurere-run the last Azure build
@hudi-bot run azure
CI report:
- 18279aff65227373e172951404c192de6846cf59 Azure: SUCCESS
Bot commands
@hudi-bot supports the following commands:@hudi-bot run azurere-run the last Azure build
@alexeykudinkin Hey all comments are addressed, please take a look.
@boneanxs i approved it. @xushiyan any more comments from your end?
@xushiyan gentle ping...
@boneanxs sorry for the delay. A follow-up suggestion: the partition filter and expression classes look very generic and we can consider putting them in sync-common for other catalog sync implementations, which support partition sync too
@xushiyan Thanks for merging this. Sure, I will do this in a follow-up pr.