desbordante-core
desbordante-core copied to clipboard
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Some parts of the main branch are useless for a potential user: google test library, unit tests, CI instruction, etc. Consider creating a separate, lightweight branch containing only parts essential...
It appears that a schema pointer may be removed from the Vertical class without functionality loss. It would reduce memory cost, probably increase performance, and is overall an SE-reasonable fix....
As a CMake project, Desbordante is automatically indexed by CLion. This configuration, however, leads to unfriendly search experience, as a bunch of files, namely gtest sources and datasets, pop up...
Some algorithms may process some datasets for an extremely long time, therefore one test case can delay the following ones (the user will have to wait through the heavy test)....
Desbordante could be built on Windows at commit `68d9cdd91f7b5719416bb3c82d300a5e3ce8da73`, but it has lost this ability. Time to investigate the issue and support Windows.