feature-engineering topic
hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Projects-on-Data-Cleaning-and-Manipulation
This repository contains projects I have worked on for Data Cleaning and Manipulation in Python.
bubble_plot
Visualize linear and non-linear connections between numerical/categorical features (2D histogram with bubbles)
lambdo
Feature engineering and machine learning: together at last!
Home-Credit-Default-Risk-Recognition
The project provides a complete end-to-end workflow for building a binary classifier in Python to recognize the risk of housing loan default. It includes methods like automated feature engineering for...
Natural-Language-Processing-with-Machine-Learning
This repository builds a basic understanding of Natural Language Processing and Machine Learning tasks around it.
skrobot
skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
pic2vec
Lightweight Image Featurization Made Easy
Feature-Extractors-for-Video-Steganalysis
To provide the stego community with C/C++ implementations of selected feature extractors mainly targeted at H.264 steganography.