Oleks V

Results 186 comments of Oleks V

That actually was on my backlog couple of months. It is nice to support an array of files or globs

I'll try to take in 2 weeks if no one else beats me to it

I dont remember the roots, so wondering, can we investigate and use a single type system which is Arrow Types and get rid of other types. At the end of...

are we waiting to address any feedback on this PR?

Thanks @andygrove I'm planning to profile it. Just for reference Q72 using INNER and LEFT join types

Im running slightly changed Q72 in DF ``` select i_item_desc ,w_warehouse_name ,d1.d_week_seq ,sum(case when p_promo_sk is null then 1 else 0 end) no_promo ,sum(case when p_promo_sk is not null then...

![smj](https://github.com/user-attachments/assets/da7c7e58-e934-472b-8d7e-c86ba467d056) Looks like a lot of time spent on `arrow_select::take::take_impl` which is specific for filtered join. `take` gets filtered matched indices from the joined data

This is still relevant checked on TPCH Q21 SMJ without filter ``` Query 21 took 121.45100903511047 seconds ``` with filtered SMJ ``` Query 21 took 557.8474290370941 seconds ```