py icon indicating copy to clipboard operation
py copied to clipboard

Missing value

Open SK-ASIF-ALI opened this issue 5 years ago • 1 comments

In the age column, there are 177 Nan. How to deal with whether should I delete them or put the mean of age column??

SK-ASIF-ALI avatar Dec 06 '20 18:12 SK-ASIF-ALI

drop the feature or fill missing value ? If the number of NaN is great then you may consider to drop the feature otherwise fill the missing values with mean or median.

mean or median ? If there is outliers in the features consider median to fill NaN as outliers affect the mean values.

How to find outliers ? If the skew of feature is right or left then features may have outliers.

btirth avatar Feb 13 '21 16:02 btirth