amoro icon indicating copy to clipboard operation
amoro copied to clipboard

[Bug]: Unpartitioned tables in Mixed-Hive format generate a lot of .keep files

Open shendanfengg opened this issue 2 years ago • 1 comments

What happened?

image image image

As shown in the figure in the Mixed-Hive Format for unpartitioned tables each commit will generate an empty file /.keep, which will lead to the existence of a large number of useless empty directories in the table to cause pressure on the hdfs

Affects Versions

master

What engines are you seeing the problem on?

Core

How to reproduce

  1. Create a Mixed-Hive Table with out partition
  2. Write data normally and execute Self-Optimizing
  3. Observe the hdfs file for this table

Relevant log output

No response

Anything else

No response

Are you willing to submit a PR?

  • [X] Yes I am willing to submit a PR!

Code of Conduct

  • [X] I agree to follow this project's Code of Conduct

shendanfengg avatar Nov 30 '23 09:11 shendanfengg

Any update on this issue? @shendanfengg

zhoujinsong avatar Jun 20 '24 03:06 zhoujinsong

This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.

github-actions[bot] avatar Dec 18 '24 00:12 github-actions[bot]

This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'

github-actions[bot] avatar Jan 01 '25 00:01 github-actions[bot]