map-reduce topic
datastore-mapper
Appengine Datastore Mapper in Go
gleam
Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
ThreadsX.jl
Parallelized Base functions
Transducers.jl
Efficient transducers for Julia
Spark-with-Python
Fundamentals of Spark with Python (using PySpark), code examples
poseidon
A search engine which can hold 100 trillion lines of log data.
python-bigdata
Data science and Big Data with Python
pypar
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
flox
Fast & furious GroupBy operations for dask.array