evalml icon indicating copy to clipboard operation
evalml copied to clipboard

Add ccp_alpha for pruning to Tree based estimators

Open ParthivNaresh opened this issue 3 years ago • 1 comments

As of sklearn version 0.22, ccp_alpha has been added as a pruning parameter for Decision Trees, Extra Trees, and Random Forests.

Adding this as a hyperparameter would give AutoML an additional parameter to iterate over and prevent overfitting which is a common issue with trees that become too large.

https://scikit-learn.org/stable/modules/tree.html#minimal-cost-complexity-pruning

This could be broken up into 3 issues, one for each estimator class.

ParthivNaresh avatar Mar 22 '21 17:03 ParthivNaresh

This could be broken up into 3 issues, one for each estimator class.

Let's start with one of our tree-based estimators and demonstrate a performance improvement. Then we can file issues and get to the others.

dsherry avatar Mar 25 '21 15:03 dsherry