hudi icon indicating copy to clipboard operation
hudi copied to clipboard

[HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark

Open jonvex opened this issue 1 year ago • 5 comments

Change Logs

iter.map as well as iterator.toScala were not calling close. We now don't use those.

Impact

Don't leak memory (at least here)

Risk level (write none, low medium or high below)

low

Documentation Update

N/A

Contributor's checklist

  • [ ] Read through contributor's guide
  • [ ] Change Logs and Impact were stated clearly
  • [ ] Adequate tests were added if applicable
  • [ ] CI passed

jonvex avatar Dec 20 '23 18:12 jonvex

@jonvex , the tests are failing.

linliu-code avatar Dec 21 '23 00:12 linliu-code

Looks promising, the OOM is gone, the test failures look not related.

danny0405 avatar Dec 21 '23 02:12 danny0405

Did we chase down every reader (base files, log files, iterator) part or the new file group and ensured this is the only gap ? @jonvex @linliu-code

I haven't. I will try to do that later.

linliu-code avatar Dec 21 '23 17:12 linliu-code

Did we chase down every reader (base files, log files, iterator) part or the new file group and ensured this is the only gap ? @jonvex @linliu-code

I haven't. I will try to do that later.

I checked all the log blocks and the reader context. Could you check the record buffers?

jonvex avatar Dec 21 '23 17:12 jonvex

CI report:

  • 33a87e77b985a8fd3fe0a6a997059ee20fbedb8b UNKNOWN
  • 9819ca4db7b4ab9f2476aecc753e3fcc09c7cb7a Azure: FAILURE
Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

hudi-bot avatar Jan 02 '24 19:01 hudi-bot

Azure CI is passsing: https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21791

jonvex avatar Jan 02 '24 22:01 jonvex