hudi
hudi copied to clipboard
[HUDI-8370] Removed excessive `DataBucket::preWrite`
Change Logs
DataBucket::preWrite sets new current location, using fileID from DataBucket, only for first record in the bucket. But fileID is set only during DataBucket initialization, where we use passed first record. So, we actually don't need to call preWrite.
Impact
No impact
Risk level (write none, low medium or high below)
Low
Documentation Update
No need
Contributor's checklist
- [x] Read through contributor's guide
- [x] Change Logs and Impact were stated clearly
- [x] Adequate tests were added if applicable
- [x] CI passed
CI report:
- 953da4d1592ec741f44a237bc95384b0f96619c7 Azure: SUCCESS
Bot commands
@hudi-bot supports the following commands:@hudi-bot run azurere-run the last Azure build
@danny0405 ,
I've also reviewed lifecycle of current location for Flink stream write:
We don't need to call
preWrite(), it updates only fileID, which is similar for all records in the bucket. And each record has set current location.