Ken, Wang
Ken, Wang
I will take care of this ticket.
1. Support Cache Table Cmd 2. Cache location aware scheduling 3. Cache files management across Cluster, cache data file placement policy. 4. Rewrite Physical plan and replace remote files with...
Does iceberg rust support hadoop catalog? I only see `s3tables` in the code base.
OpenDAL的HDFS写路径问题挺多的
而且OpenDAL套路有点深,问题很难triage
I remember the filters will be added to wrong side. I can work on this and provide a fix.
One quick question regarding this, after those dataset are copied to each executor, should they kept in-memory or spilled to disk, if keep them in memory for a while, memory...
@Dandandan Sounds nice. There are couple of things need to do to support the broadcasting exchange. 1. Rpc protocols to efficiently do broadcasting, something similar to Spark's TorrentBroadcast 2. Executor...
Today, For partitioned hash join, DataFusion already support CollectLeft model, I think it is similar to the Broadcast HashJoin. I do not get a chance to test it on Ballista...
Add a design doc for further discussion. https://docs.google.com/document/d/17J9H6gGBVktmRAFYNQu-v52QUUPlghRnVLIZC3mFYFY/edit?usp=sharing