EconML icon indicating copy to clipboard operation
EconML copied to clipboard

DML discrete outcome

Open nicolavizioli opened this issue 1 year ago • 4 comments

Hi, I saw that now DML (I'm using Linear) gives the possibility to select "discrete outcome=True", this means than now residuals of outcome model are not more only {-1,0,1} and then I can use a classifier like XGBClassifier() for outcome model? thanks

nicolavizioli avatar May 03 '24 15:05 nicolavizioli

That's correct - if you set discrete_outcome=True then your y_model should be an sklearn classifier rather than a regressor and we'll use predict_proba rather than predict, so the residuals can take on any values in the range [-1,1].

kbattocchi avatar May 03 '24 17:05 kbattocchi

Hi @kbattocchi and others! The use of the response residuals y - p seems problematic to me for a couple of reasons (their discreteness being one of them). Is there any theoretical justification for there use in the LinearDML approach? Can't seem to find anything in the literature and documentation.

brandongreenwell-8451 avatar Dec 10 '24 20:12 brandongreenwell-8451

@brandongreenwell-8451 I don't think discreteness of the outcome should pose any particular problem - I believe the theory just requires that you can model E[Y|X,W], which is what using predict_proba with a classifier should give you (see https://econml.azurewebsites.net/spec/estimation/dml.html#overview-of-formal-methodology).

kbattocchi avatar Dec 11 '24 20:12 kbattocchi

Thanks @kbattocchi, I'll take another pass through the docs. Any idea how to interpret the result in this case?

brandongreenwell-8451 avatar Dec 18 '24 13:12 brandongreenwell-8451