scikit-criteria icon indicating copy to clipboard operation
scikit-criteria copied to clipboard

Data cleaning

Open leliel12 opened this issue 2 years ago • 1 comments

Data cleaning

  • It is understandable that some data sets have invalid values.
  • The most common invalid values are NaN, inf, and -inf but there may be cases where some finite values are invalid (debts < 0 for example).
  • It would be useful to have some kind of functionality that allows removing these invalid values.
  • values or replace them with some useful value with some heuristic.

leliel12 avatar Feb 21 '22 22:02 leliel12

Useful links

  • https://scikit-learn.org/stable/modules/impute.html
  • https://scikit-learn.org/stable/modules/classes.html#module-sklearn.impute

leliel12 avatar Feb 22 '22 13:02 leliel12