PySpark-Confluent-Kafka-Apache-Drill-
PySpark-Confluent-Kafka-Apache-Drill- copied to clipboard
A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows f...
PySpark + Optimus, Confluent Kafka, Apache Drill, Cassandra/NoSQL + Docker code example
A code-based tutorial on setting up production grade data streams with PySpark, Optimus, Confluent Kafka, & Drill using Docker, with Cassandra (NoSQL) as storage.
(See code and README.md's in nested folders)