grf icon indicating copy to clipboard operation
grf copied to clipboard

Questions regarding clusters for the first stage in the orthogonal causal forest

Open LuqianSun opened this issue 3 years ago • 3 comments
trafficstars

hi, in the R documentation, Y.hat can be predicted by boosted regression forest. Can it be used to predict W.hat?

Also, in the paper "Estimating Treatment Effects with Causal Forests: An Application", the authors mention we can use other predictive methods like boosting with cross-fitting, is there any examples for that?

LuqianSun avatar Jul 12 '22 12:07 LuqianSun

Yes, you can use any good prediction method to predict Y.hat and W.hat. If you know the treatment propensities you can just supply that instead.

erikcs avatar Jul 14 '22 18:07 erikcs

Thanks a lot, Eric! I meant to ask whether in the first stage when predicting Y_hat and W_hat if we should also set clusters=city_name? We have checked "Estimating Treatment Effects with Causal Forests: An Application" paper and grf GitHub menu and site but could not figure out what is the difference whether setting clusters or not in predicting Y-hat and W_hat.

What exactly does it mean to set clusters=school.id in Wager's "Estimating Treatment Effects with Causal Forests: An Application" paper's case in regression_forest?

Does this give different predictions compared with not setting clusters=school.id when predicting Y_hat and W_hat?

LuqianSun avatar Jul 17 '22 11:07 LuqianSun

Hi @LuqianSun, yes setting clusters for W.hat and Y.hat makes sense if you are setting them for causal forest. The online reference has a section on clustering here: https://grf-labs.github.io/grf/REFERENCE.html#cluster-robust-estimation

erikcs avatar Jul 17 '22 22:07 erikcs