Andrew Lamb
Andrew Lamb
There is a blog post about this from duckdbL https://duckdb.org/2022/05/27/iejoin.html
> I am willing to have a try if nobody is working on this. I don't know of anyone working on this. Maybe @Dandandan @liukun4515 or @ozankabak know of others...
BTW I think "Range Join" is pretty similar in spirit and does not require special SQL syntax https://www.vertica.com/blog/what-is-a-range-join-and-why-is-it-so-fastba-p223413/ https://www.vertica.com/docs/9.2.x/HTML/Content/Authoring/AnalyzingData/Queries/Joins/RangeJoins.htm#:~:text=Vertica%20provides%20performance%20optimizations%20for,two%20columns%20of%20another%20table. I think there is some non trivial overlap with the notion...
This topic came up at the recent DataFusion meetup in San Franciso as something of interest. Is there anyone willing to help drive / lead this project? I can't offer...
> We can make versions of join algorithms that will take that data ordering and execute even more efficiently than the standard hash join this query compiles into. FWIW this...
> [1] @alamb, writing up this example makes me think it could be a good idea to file ticket to support order specification for memory tables too. I agree --...
We could contemplate implementing both 🤔 Specifically, if we got the DataFusion engine to be able to identify range joins in general and `ASOF` joins are special case, we could...
FWIW this came up at InfluxData recently and we are considering investing more in this area. I will keep this ticket updated
I think the blog posts / code from the systems in this ticket are a good place to start ALso @my-vegetable-has-exploded had some code here that I don't think ever...
> Perhaps we could consider implementing them as plugins to reduce the review pressure? (Though I don't know how to using plugin in datafusion.) I agree that having a better...