datafusion icon indicating copy to clipboard operation
datafusion copied to clipboard

Write a blog post about implementing StringView in DataFusion

Open alamb opened this issue 1 year ago • 5 comments

As part part of the InfluxData / @pauldix philosophy of "write cool software and tell people about it" we plan to tell people about the great work that went into making StringView work in DataFusion

As part of https://github.com/apache/datafusion/issues/10918 and the intern process at InfluxData, @XiangpengHao plans to write a blog post about the experience and what it takes to do this kind of performance optimization

I am filing this ticket to give it visibility

We plan to publish the blog on InfluxData's blog first, but then will propose cross-posting on the DataFusion blog.

If anyone is interested in reviewing or helping, please let us know

alamb avatar Jul 22 '24 15:07 alamb

@XiangpengHao -- can you possibly comment / take this issue (I can't assign it to you without you having first commented on it 😆 )

alamb avatar Aug 02 '24 19:08 alamb

take

XiangpengHao avatar Aug 03 '24 00:08 XiangpengHao

We are done with the draft. We expect it to be published in the next few weeks (it turns out to be a two part series)

alamb avatar Aug 12 '24 10:08 alamb

In case anyone wants an overview of adding StringView to DataFusion, here is a presentation and slides Video: https://www.youtube.com/watch?v=RVLshX6fbds Slides: https://drive.google.com/file/d/1Qqd8V6cfS9rSQ_-JrinasQJwI79qlUEV/view?usp=drive_link

Blog is coming soon

alamb avatar Aug 16 '24 11:08 alamb

Part 1: https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/

alamb avatar Aug 27 '24 13:08 alamb

Part 2: https://www.influxdata.com/blog/faster-queries-with-stringview-part-two-influxdb/

cc @XiangpengHao

I would like to also make a copy of this content on the DataFusion blog as it is entirely about DataFusion and might be easier to discover there

alamb avatar Sep 10 '24 16:09 alamb

I made a PR to add a copy of these blogs to the DataFusion site: https://github.com/apache/datafusion-site/pull/25

alamb avatar Sep 13 '24 11:09 alamb

Still waiting on a committer to approve https://github.com/apache/datafusion-site/pull/25 and then I will close this ticket

alamb avatar Sep 23 '24 16:09 alamb