hyppo icon indicating copy to clipboard operation
hyppo copied to clipboard

A nonparametric test based on regression error (FIT)

Open MatthewZhao26 opened this issue 3 years ago • 1 comments

A nonparametric test based on regression error (FIT) [paper] [python code]

  • A bit more fringe than KCI/KCIP but provides good simulation comparisons between all three methods plus more.
  • Uses a nonparametric regression (in their case, a decision tree) to examine the change in predictive power based on including versus excluding some variables Z.
  • Uses the mean squared error as a test statistic and an analytic Gaussian/T-test approach to compute a pvalue
  • Seemingly efficient for large samples sizes as compared to other kernel based approaches.
  • Interesting connections in that trees/forests are adaptive kernel methods and extensions to forests/honesty/leaf permutations.

[Issue 226]

MatthewZhao26 avatar Feb 09 '22 04:02 MatthewZhao26

In the paper they use a decision tree to compute the mean squared error, but I don't think there is anything stopping any other method. Thus the implementation may want to input a general sklearn style regression function.

rflperry avatar Feb 09 '22 07:02 rflperry