Yujiang Zhong
Yujiang Zhong
> The "bucket" is one way to transform partitions in Iceberg. Yea, I think Iceberg refers to all methods of dividing datasets as partitioning. +1 on introducing the truncate transform....
cc @JingsongLi Hi, what do you think?
Hi @BsoBird Tired storage is one of the application scenarios. Implementing multi-location management has additional benefits, such as enabling a smoother transition of data from Hadoop to cloud object storage....
@BsoBird I refer to data administrators as the platform’s data administrators, not the data owners. Table owners can set cold data partition archiving options based on their needs, and the...
Although moving large amounts of data is a heavy load, the load for metadata changes is not high, so metadata commits should be relatively fast. This is similar to concurrent...
@BsoBird Well, that’s not what I meant. What I meant is that if Client-A and Client-B commit simultaneously, once one succeeds, the other commit should fail, but it should not...
@JingsongLi Thanks or replying. We are still in the very early stages of planning this solution, so we’d like to confirm with the community whether this approach is feasible. Once...
@wg1026688210 > Can the disscusion content of this issue serve as a low-cost solution for migrating Hive to a Paimon Append table? So that we don't need to copy data...
We ran into the same issue, but I believe it should already be fixed in the new Iceberg 1.10 release. This [commit](https://github.com/apache/iceberg/commit/30fd752f24d796ce548dbeef550e875941ce6fb7) removed the logic of reading the last-column-id, since...
@twuebi The fix PR is only included in version 1.10, which was released in Sep. And I’ve verified it. I backported it to a lower version in our internal setup(we...