recipes icon indicating copy to clipboard operation
recipes copied to clipboard

remove highly correlated categorical variables

Open zhaoliang0302 opened this issue 1 year ago • 2 comments

Hi,

step_corr() can remove highly correlated continuous variables using Pearson or Spearman correlation analysis. However, prefilter functions for categorical variables were not provided in the recipes package. I have 20 columns with categorical variables (using one-hot encoding), and I want to remove redundant columns which were correlated with each other. Can you give me some advice? Thanks

Best regards

zhaoliang0302 avatar Oct 11 '22 03:10 zhaoliang0302