patsy
patsy copied to clipboard
Describing statistical models in Python using symbolic formulas
Not sure what's going on here. ``` In [44]: patsy.dmatrix("C(a)", {'a':['m', 'n', 'o']}) Out[44]: --------------------------------------------------------------------------- TypeError Traceback (most recent call last) in () ----> 1 dmatrix("C(a)", {'a':['m', 'n', 'o']}) /Users/mike/venv/sci/lib/python2.7/site-packages/patsy/highlevel.pyc...
So one of my datasets can be super-huge and therefore, super-slow by my impatient standards. I can sit there for 3-5 minutes just waiting for a design matrix to be...
This is a suggestion to implement backticks as an alias for quoting `Q('...')`. E.g.: ``` Q('x/y') == `x/y` ``` Rationale: 1. Traditional: `R` syntax allows addressing fields as: ``` data$`x/y`...
Not sure if this is out of scope for the project, but this PR tries to smooth over the only issue I've came across so far when using Patsy with...
Hello, I fit some (relatively) large-ish GLMs in statsmodels and have been experimenting with using `patsy` instead of a home rolled thing. My home rolled method isn't very good (I...
Patsy automatically remove redundant columns (linearly dependent) so that the final matrix is not overdetermined. is there an option to turn off the removal? I would like to use patsy...
Given that patsy is dropping support for Python 2.6 in the next release, it might be a good time to drop support for other old pieces of software, like Pandas...
I am dealing with a situation where each item can have 1 or 2 labels both of which come from the same set. In this case I only want to...
It would be nice to have a function, say "N( )", that indicates that input data with pandas dtype "category" should be interpreted as numerical data. In that sense this...
In edge cases where a variable has zero variance, standardize should do the same as center rather than producing an error. Suppose I am fitting a linear model with a...