morfist icon indicating copy to clipboard operation
morfist copied to clipboard

What's the difference between morfist and the sklearn RandomForestRegressor with multi-output when dealing with only regression problems??

Open Qianqian-Yang opened this issue 2 years ago • 1 comments

When dealing with only regression problems, the results of sklearn RandomForestRegressor is different from that of morfist, what caused the difference? I'm not quite clear about the implementary details of multioutput sklearn RandomForestRegressor and morfist when conducting multioutput-regression.

Qianqian-Yang avatar Jun 03 '22 08:06 Qianqian-Yang

Specifically for regression problems, there's no substantial difference to scikit-learn's RandomForestRegressor that I'm aware of. Same goes for classification problems and scikit-learn's RandomForestClassifier.

What morfist does differently (and, I don't think there's currently support for this in scikit-learn) is offer random forest models with a mix of classification and regression tasks. By default, morfist treats output variables as regression variables, but you can pass a list of output-variable indices to the class_targets parameter of the constructor to force the training algorithm to evaluate select output variables as discrete/classification targets.

donlnz avatar Jun 10 '22 18:06 donlnz