Sanchit Kumar

Results 7 repositories owned by Sanchit Kumar

goodreads_etl_pipeline

1.2k
Stars
210
Forks
Watchers

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Udacity-Data-Engineering-Projects

1.4k
Stars
461
Forks
Watchers

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.

Cloudera_Material

29
Stars
25
Forks
Watchers

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

goodreads

27
Stars
5
Forks
Watchers

:snake: Python wrapper for Goodreads API :books:

Optimizing-Public-Transportation

26
Stars
12
Forks
Watchers

A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.

data-engineer-roadmap

15
Stars
10
Forks
Watchers

Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups

Big_Data_Project

15
Stars
10
Forks
Watchers

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.