PySpark-Confluent-Kafka-Apache-Drill- icon indicating copy to clipboard operation
PySpark-Confluent-Kafka-Apache-Drill- copied to clipboard

A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows f...

PySpark + Optimus, Confluent Kafka, Apache Drill, Cassandra/NoSQL + Docker code example

A code-based tutorial on setting up production grade data streams with PySpark, Optimus, Confluent Kafka, & Drill using Docker, with Cassandra (NoSQL) as storage.

(See code and README.md's in nested folders)