awesome-arrow-r
awesome-arrow-r copied to clipboard
Awesome resources for learning more about Apache Arrow
Awesome Arrow πΉ
Awesome resources for learning more about things relating to Apache Arrow, focussed on the R package arrow.
If you have any suggestions for other resources to add here, please submit a PR!
Key:
π©βπ« Workshop
π Blog post
π½οΈ Video
ποΈ Slides
Official docs
General overview
- "Larger-Than-Memory Data Workflows with Apache Arrow" - UseR! 2022 conference workshop π©βπ«
- "Doing More with Data: An Introduction to Arrow for R Users" by Danielle Navarro π½οΈ
- "Getting started with Apache Arrow" by Danielle Navarro π
- "Efficient Data Analysis on Larger-than-Memory Data with DuckDB and Arrow" by Tom Mock π½οΈ
- "Bigger data with arrow and duckdb" by Tom Mock & Edgar Ruiz ποΈ
- "New Directions for Apache Arrow" by Wes McKinney π½οΈ
- "Bigger Data With Ease Using Apache Arrow" by Neal Richardson π½οΈ
- "Apache Arrow: Enabling Data Engineering Tasks in R" by Ian Cook π½οΈ
Data types and Arrow objects
- "Data serialisation in R" by Danielle Navarro π
- "Data types in Arrow and R" by Danielle Navarro π
- "Arrays and tables in Arrow" by Danielle Navarro π
Arrow bindings/Acero
- "Binding Apache Arrow to R" by Danielle Navarro π
- "Arrow New Feature Showcase: show_exec_plan()" by Nic Crane π
File formats and partitioning
- "Creating an Arrow dataset: An exploration of the file formats that Arrow can read and write." by FranΓ§ois Michonneau π
- "Creating an Arrow dataset (part 2): How does partitioning impact query performance?" by FranΓ§ois Michonneau π
- "Understanding the Parquet file format" by Colin Gillespie π
Geoarrow
- "Building Bridges: Arrow, Parquet, and Geospatial Computing" by Dewey Dunnington π
- "Accelerating geospatial computing using Apache Arrow" by Dewey Dunnington π½οΈ
- "Accelerating Geospatial Computing in R and Python Using Apache Arrow" by Dewey Dunnington and Joris Van den Bossche π½οΈ