desbordante-core
desbordante-core copied to clipboard
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Add tests for cli parsing options and help pages
Add script for testing examples (without gfd). Bindings now return sorted list of fds (only for algos inherited from FDAlgorithm). Update README to the current output
I've made this error when remaking the bindings and now see it copied by others. Hopefully, not after this.
TANE algorithm implementation missed key pruning (supposedly because of AUCC discovery, which required much larger search space). Since we no longer need AUCC discovery in this algorithm, original key pruning...
Implements the cords algorithm for mining of soft functional dependencies and correlations
Generalize Tane and PFDTane, add additional tests. In order to check if the refactoring caused any performance loss, following experiments were performed. The discovery task was run as `cli.py --task=afd...
Implement a ctor to create a Statistic object without passing a pointer to Type object
The algorithm is still unfinished, but it's usable now.