realtime-voting-data-engineering
realtime-voting-data-engineering copied to clipboard
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to e...
Realtime Election Voting System
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
System Architecture

System Flow

System Components
- main.py: This is the main Python script that creates the required tables on postgres (
candidates,votersandvotes), it also creates the Kafka topic and creates a copy of thevotestable in the Kafka topic. It also contains the logic to consume the votes from the Kafka topic and produce data tovoters_topicon Kafka. - voting.py: This is the Python script that contains the logic to consume the votes from the Kafka topic (
voters_topic), generate voting data and produce data tovotes_topicon Kafka. - spark-streaming.py: This is the Python script that contains the logic to consume the votes from the Kafka topic (
votes_topic), enrich the data from postgres and aggregate the votes and produce data to specific topics on Kafka. - streamlit-app.py: This is the Python script that contains the logic to consume the aggregated voting data from the Kafka topic as well as postgres and display the voting data in realtime using Streamlit.
Setting up the System
This Docker Compose file allows you to easily spin up Zookkeeper, Kafka and Postgres application in Docker containers.
Prerequisites
- Python 3.9 or above installed on your machine
- Docker Compose installed on your machine
- Docker installed on your machine
Steps to Run
- Clone this repository.
- Navigate to the root containing the Docker Compose file.
- Run the following command:
docker-compose up -d
This command will start Zookeeper, Kafka and Postgres containers in detached mode (-d flag). Kafka will be accessible at localhost:9092 and Postgres at localhost:5432.
Additional Configuration
If you need to modify Zookeeper configurations or change the exposed port, you can update the docker-compose.yml file according to your requirements.
Running the App
- Install the required Python packages using the following command:
pip install -r requirements.txt
- Creating the required tables on Postgres and generating voter information on Kafka topic:
python main.py
- Consuming the voter information from Kafka topic, generating voting data and producing data to Kafka topic:
python voting.py
- Consuming the voting data from Kafka topic, enriching the data from Postgres and producing data to specific topics on Kafka:
python spark-streaming.py
- Running the Streamlit app:
streamlit run streamlit-app.py
Screenshots
Candidates and Parties information

Voters

Voting

Dashboard

