hudi icon indicating copy to clipboard operation
hudi copied to clipboard

[HUDI-4582] Support batch synchronization of partition to hive metastore to avoid timeout with --sync-mode="hms" and use-jdbc=false

Open honeyaya opened this issue 3 years ago • 5 comments

Change Logs

Describe context and summary for this change. Highlight if any code was copied.

What is the purpose of the pull request

when we try to sync 11w partitions to hive by using HiveSyncTool(--sync-mode="hms" and use-jdbc=false) with timeout error.

With https://issues.apache.org/jira/browse/HUDI-2116, this only solved --sync-mode = jdbc with the parameter: HIVE_BATCH_SYNC_PARTITION_NUM, and I want to extend this to hms mode.

Contributor's checklist

  • [ ] Read through contributor's guide
  • [ ] Change Logs and Impact were stated clearly
  • [ ] Adequate tests were added if applicable
  • [ ] CI passed

honeyaya avatar Aug 09 '22 10:08 honeyaya

@hudi-bot run azure

@hudi-bot run azure

run azure

honeyaya avatar Aug 10 '22 02:08 honeyaya

@hudi-bot run azure

@hudi-bot run azure

run azure

honeyaya avatar Aug 10 '22 04:08 honeyaya

@jsbali @yihua could you help review this pr, thanks

honeyaya avatar Aug 10 '22 04:08 honeyaya

@hudi-bot run azure

@hudi-bot run azure

run azure

honeyaya avatar Aug 11 '22 06:08 honeyaya

@honeyaya please also simplify the pr title and add details in change logs section.

xushiyan avatar Aug 30 '22 03:08 xushiyan

CI report:

  • 96040212b1e5607f6822e217e81fbcee7cf79af7 Azure: SUCCESS
Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

hudi-bot avatar Aug 31 '22 05:08 hudi-bot