scikit-criteria
scikit-criteria copied to clipboard
Data cleaning
Data cleaning
- It is understandable that some data sets have invalid values.
- The most common invalid values are
NaN
,inf
, and-inf
but there may be cases where some finite values are invalid (debts < 0 for example). - It would be useful to have some kind of functionality that allows removing these invalid values.
- values or replace them with some useful value with some heuristic.
Useful links
- https://scikit-learn.org/stable/modules/impute.html
- https://scikit-learn.org/stable/modules/classes.html#module-sklearn.impute