data-wrangling topic

List data-wrangling repositories

prosto

90
Stars
4
Forks
Watchers

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

dasel

4.9k
Stars
112
Forks
Watchers

Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

Data-science-best-resources

2.8k
Stars
965
Forks
Watchers

Carefully curated resource links for data science in one place

ml

63
Stars
16
Forks
Watchers

A 60 days+ streak of daily learning of ML/DL/Maths concepts through projects

datatest

288
Stars
15
Forks
Watchers

Tools for test driven data-wrangling and data validation.

OpenRefine

10.6k
Stars
1.9k
Forks
Watchers

OpenRefine is a free, open source power tool for working with messy data and improving it

Web-Database-Analytics

260
Stars
170
Forks
Watchers

Web scrapping and related analytics using Python tools

optimus

1.4k
Stars
233
Forks
Watchers

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

cracking-the-data-science-interview

3.2k
Stars
933
Forks
Watchers

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

ModernDive_book

736
Stars
462
Forks
Watchers

Statistical Inference via Data Science: A ModernDive into R and the Tidyverse