reghdfe icon indicating copy to clipboard operation
reghdfe copied to clipboard

reghdfe postestimation influence

Open laszlosandor opened this issue 5 years ago • 2 comments

I was wondering how much of this could work with reghdfe: https://www.stata.com/features/overview/linear-regression-and-influence/

Currently (v 5.7.2) dfbeta surely does not.

FWIW, I was hoping to learn where my identification is coming from. Not just on the conceptual level but observation-by-observation.

laszlosandor avatar Nov 06 '19 13:11 laszlosandor

Dfbeta appears to require one regression per observation (kinda like leave one out regression). That's a bit insane from a computational perspective and for the usual dataset size of reghdfe, so I don't think it's practical.

There might be a few ways to speed up the leave-one-out computations, but they would involve a lot of code.

sergiocorreia avatar Nov 06 '19 13:11 sergiocorreia

Yeah, I'm not sure what I was hoping for. They surely have no other general-purpose solution other than looping through all observations and if them out one by one.

OK, other than that, is there a reasonable way to detect which observations influence reghdfe estimates most? If not for each coefficient separately, at least for overall leverage or something?

laszlosandor avatar Nov 06 '19 14:11 laszlosandor