Erik Sverdrup comments

Results 119 comments of


                                            Erik Sverdrup

trafficstars

ATE over subsets with low and high estimated CATEs - nonsensical results

I'm not sure about reference, but here is a simple example illustrating the issue with an OOB mean: Let $Y_i \sim Bernoulli(\mu)$, $i=1...n$, with mean $\mu=0.5$. Then $\mu^{(-1)} = \mu...

Parameter tuning for RDD / lm_forest

Hi @corydeburd, That MSE is a reasonable object to consider. Another approach is to treat RDD Forest purely as a data-driven algorithm to find heterogeneous subgroups, i.e: Split data into...

Parameter tuning for RDD / lm_forest

Hi @yusukematsuyama, there's no fixed rule for the train/test, 50/50 and 70/30 are just some common choices. Forests are usually robust wrt tuning parameters, it's hard to say which range...

Assessing treatment heterogeneity in instrumental_forest

Hi @minnnjecho, 1. The TOC/RATE can "subsume" the test_calibration exercise, so no plan for future IV support. 2. A significant held-out RATE suggests the tuned forest was able to detect...

RATE with low treatment propensities --- target.sample="treated"?

Hi @robert702, that's an interesting question. Since the AUTOC can be represented as a weighted ATE ((8) in https://arxiv.org/pdf/2111.07966.pdf) I wonder if RATE + [Crump et al. (2009)'s](https://academic.oup.com/biomet/article-abstract/96/1/187/235329?redirectedFrom=fulltext) subsetting via...

RATE with low treatment propensities --- target.sample="treated"?

My immediate reaction would be to just do what's posted above, that's one of the reasons I added the `subset` argument to the rate function.

Including variable labels in the policy tree visualization

Hi @njawadekar, you can rename `tree$columns` to the names you prefer, then plot.

ATE+Clustering

If you are in a setting with binary W you can have a look at the first expression for Gamma https://grf-labs.github.io/grf/articles/muhats.html to understand what the code above is doing. If...

Access to value of criterion for splits

Hi @n-ihlo, grf doesn't store the value of the criterion it uses to determine splits. Causal forest does CART splits on an appropriate "pseudo-outcome". All the information needed to reconstruct...

Questions Regarding Causal Forest Variable Types and "test_calibration" Interpretation

Hi @Matt9617, 1. Yes. 2. That's correct. 3. Stefan has a nice video lecture [here](https://youtu.be/fAUmCRgpP6g?list=PLxq_lXOUlvQAoWZEqhRqHNezS30lI49G-&t=1241) on interpreting that kind of calibration exercise. You might find this recent grf feature, [RATE](https://grf-labs.github.io/grf/articles/rate.html),...