patsy
patsy copied to clipboard
Describing statistical models in Python using symbolic formulas
As discussed in https://github.com/pydata/patsy/issues/10, the dot/period on the RHS of a formula is not currently supported in Patsy. Could you update the docs, in the Comparison to R section https://patsy.readthedocs.io/en/latest/R-comparison.html,...
There's a [request here](https://stackoverflow.com/questions/34035912/patsy-new-levels-in-categorical-fields-in-test-data/45094458?noredirect=1#comment77176675_45094458) to add an option so that when an unrecognized categorical level is encountered, it should be encoded as all-zeros, which is apparently similar to what [scikit-learn's...
For example I have some variables "x1, x2, x3, ...", and want all polynomials with a simple choice on the degree. The only solution I know so far is by...
We've recently ran into a pretty silly problem with terms naming when using the `Treatment` scheme, see below: Imports and data prep.: ``` import numpy as np from patsy import...
I have a similar issue. When doing this I get a blank assertion error as well ``` `outcome_1, predictors_1 = patsy.dmatrices("Q('%received 18+') ~ n_killed_capita_2016", aggregate_2016_df) mod_1 = sm.OLS(outcome_1, predictors_1) res_1...
Totally inexperienced user. My first Negative Binomial Regression. iPython on Google's Colab. I load the dataset as a pandas df. The features (and Target) in the formula below all appear...
I have written a function called `AbsorbingLS` that can [absorb a large number (millions) of categorical variables or categorical interactions](https://bashtage.github.io/linearmodels/devel/iv/absorbing.html). It is implemented using a Frisch-Waugh-Lovell step where the categoricals...
patsy: '0.5.1' https://github.com/statsmodels/statsmodels/issues/5552 > `SM: 0.9.0` > For categorical endog variable in logistic regression, I still have to gerneate a dummay variable for it like the following. > > ```python...
This `print` breaks the doc generation in python3 and with ipython-sphinx https://github.com/pydata/patsy/blob/4c613d0ad3009044ca3aee5a5d70bd56af8f396b/doc/spline-regression.rst#L182
AFAICS, patsy sets all exterior knots in `BS` at the same points, i.e. lower_bound and upper_bound. https://github.com/pydata/patsy/blob/master/patsy/splines.py#L229 I'm trying to replicate some mgcv functions, and mgcv chooses by default spread...