apache-spark topic
azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
pyspark-asyncactions
Asynchronous actions for PySpark
pyspark-stubs
Apache (Py)Spark type annotations (stub files).
spark-boilerplate
A boilerplate for spark projects with docker support for local development and scripts for emr support.
flintrock
A command-line tool for launching Apache Spark clusters.
albedo
A recommender system for discovering GitHub repos, built with Apache Spark
hal-9000
Automatically setup a productive development environment with Ansible on macOS
awesome-kafka
A list about Apache Kafka
awesome-tools
curated list of awesome tools and libraries for specific domains
Agile_Data_Code_2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition