redflag icon indicating copy to clipboard operation
redflag copied to clipboard

Transform non-Gaussian features before outlier detection

Open kwinkunks opened this issue 2 years ago • 1 comments

Can't use (say) +/- 3 standard deviations if feature is non-Gaussian. So apply transformation first, eg with Yeo-Johnson transformation, see https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.PowerTransformer.html and also #46

kwinkunks avatar Aug 16 '23 14:08 kwinkunks

Also see "shifting transformation", eg https://arxiv.org/abs/2106.03899

kwinkunks avatar Aug 16 '23 14:08 kwinkunks