Oleks V
Oleks V
That actually was on my backlog couple of months. It is nice to support an array of files or globs
I'll try to take in 2 weeks if no one else beats me to it
I dont remember the roots, so wondering, can we investigate and use a single type system which is Arrow Types and get rid of other types. At the end of...
are we waiting to address any feedback on this PR?
Thanks @andygrove I'm planning to profile it. Just for reference Q72 using INNER and LEFT join types
Im running slightly changed Q72 in DF ``` select i_item_desc ,w_warehouse_name ,d1.d_week_seq ,sum(case when p_promo_sk is null then 1 else 0 end) no_promo ,sum(case when p_promo_sk is not null then...
 Looks like a lot of time spent on `arrow_select::take::take_impl` which is specific for filtered join. `take` gets filtered matched indices from the joined data
This is still relevant checked on TPCH Q21 SMJ without filter ``` Query 21 took 121.45100903511047 seconds ``` with filtered SMJ ``` Query 21 took 557.8474290370941 seconds ```
Starting CI
Triggering CI