desbordante-core
desbordante-core copied to clipboard
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
WDCPlanets is a dataset with zero rows. Looks like CSVParser processes it incorrectly, leading to different results on Pyro and TANE.
![image](https://user-images.githubusercontent.com/36251589/188654062-d417e8c0-2d72-4eb4-aebe-ad7efbe57e55.png) Steps to reproduce: 1. Attempt to open the webapp Chrome 104.0.5112.102
Now, to create TypoMiner object you must pass precise algorithm param as algo. But it looks strange, since I have to pull out the preciseAlgorithm from the config, because this...
The Desbordante repository is about 100MB while the code itself is about 1.2 MBs. This is due to heavy datasets, they have been removed from the repository but are still...
Desbordante compiles rather slowly. Find out the cause and fix it. Possible factors: - code bloat due to inefficient header inclusion - inclusion of boost headers
For CI and usability reasons it is crucial to have a step-by-step instruction, preferably in a form of docker-compose yaml. Implement a docker instruction and consider publishing a docker image:...
Frontend side needs information about available parameters, e.g. maxLHS, error, seed, etc. Therefore we need a method in base class FDAlgorithm that would concurrently output info in a form similar...