Data Science topic
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...
pyspark-cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
auto_ml
[UNMAINTAINED] Automated machine learning for analytics & production
boltons
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
logdissect
CLI utility and Python module for analyzing log files and other data.
dsr
Introduction to Data Science with R (Sciences Po, Paris, 2023)
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
awesome-computer-science-opportunities
An awesome list of events and fellowship opportunities for Computer Science students
Hello-Kaggle-Guide
For someone who is new at Kaggle