recipeselectors icon indicating copy to clipboard operation
recipeselectors copied to clipboard

Recipe step for dealing with highly correlated values for ensemble stacked models

Open kamaulindhardt opened this issue 3 years ago • 0 comments

Dear Steven Pawley & Max Kuhn, and other enthusiasts,

Regarding my request for help on RStudio Community https://community.rstudio.com/t/does-themis-package-feature-functions-for-dealing-with-continuous-data-imbalance/110432

I am in need of a solution to solve my issue of my poor performing ensemble stacked model, that I suspect to be related to some kind of feature pre-processing steps. Your package recipeselectors seems promessing.

I already perform these preprocessing steps

 step_impute_mode(Product) %>% 
  step_novel(Site_Type, Tree, -all_outcomes()) %>% 
  step_dummy(Site_Type, Tree, one_hot = TRUE, naming = partial(dummy_names,sep = "_")) %>% 
  step_zv(all_numeric(), -all_outcomes()) %>%
  step_corr(all_numeric(), -all_outcomes()) %>% 
  step_lincomb(all_numeric(), -all_outcomes()) %>% 
  step_normalize(all_numeric(), -all_outcomes()) %>%
  step_impute_mode(all_nominal(), -all_outcomes()) %>%
  step_impute_knn(logRR) 

As you see on the model evaluation graphs something goes wrong in the modelling. For some reason, my model performs exceptionally poor, especially around the centre.

Here is a snapshot of my ensemble stacked model output.

image

image

kamaulindhardt avatar Jul 20 '21 09:07 kamaulindhardt