fast_eda
fast_eda copied to clipboard
Fast data exploration using Sweetviz
Fast And Easy Data Exploration For Machine Learning
Where there is data smoke, there is business fire.
-- Thomas Redman

Table of Contents
- What is this repo about?
- Quick setup
- Example
- Let's connect!
What is this repo about?
Tired of spending βtoo much timeβ doing data exploration before training your Machine Learning models?
Looking for a faster way to understand data issues and patterns, before you dive into the fun part of training your ML model?
Wanna learn how to train better ML models, by finding and fixing issues in your data?
In this repo you can find a short Python script that used the Sweetviz library to do data exploration at the speed of light.
I also invite you to read the original article I published on my site:
π Fast And Easy Data Exploration For Machine Learning
Quick setup
You need Python >= 3.7 and
$ pip install sweetviz
Example
I attached a sample dataset for a churn prediction model, that you can quickly explore with a one-liner.
$ python eda.py --file v1.csv --target Churn
Let's connect
If you want to learn more about real-world ML topics and become a better data scientist
π Subscribe to the datamachines newsletter.