patsy icon indicating copy to clipboard operation
patsy copied to clipboard

Add an over-parametrized dummy coding scheme

Open njsmith opened this issue 10 years ago • 3 comments

To provide one way for users who definitely want overparametrized dummy coding to just do

C(myfactor, DummyDammit)

or whatever.

[Request from Josef, talked over at PyCon]

njsmith avatar Apr 14 '15 21:04 njsmith

CC @josef-pkt

njsmith avatar Apr 14 '15 22:04 njsmith

We also need the opposite -- exclude all reference categories. This is needed for Cox proportional hazards models in survival analysis and some forms of multinomial regression. In those models the intercept is not identified, so we can't have 1 in the column-space of the design matrix.

kshedden avatar Apr 19 '15 21:04 kshedden

+1 for optional overparametrized designs

chrisgorgo avatar Oct 16 '15 22:10 chrisgorgo