Taylor G Smith
Taylor G Smith
I think this is a reasonable request, and certainly a common enough use case. @charlesdrotar let's spend some time discussing
Could you take this issue on, Charles? On Jan 30, 2017 6:42 PM, "charlesdrotar" wrote: > This is a super old issue but I wanted to bring this to your...
PEP3101 says `%` will be removed from string formatting at some point. Some of the discussion [here](http://stackoverflow.com/questions/13451989/pythons-many-ways-of-string-formatting-are-the-older-ones-going-to-be-deprec) seems to assert otherwise. We can add the `.format` replacement as a wishlist...
I like this idea. But we need to consider implications on CI testing if we go that route. We certainly won't bundle a Spark dist with our releases (as we...
Maybe I misunderstand, but for binning categorical, won't a LabelEncoder implicitly "shuffle" in the sense that it will be ordered by alpha internally? Maybe I'm missing the use case. On...
Hey there. The model summary is produced by the underlying statsmodels object. It appears to be using the [Goldfeld-Quandt](https://en.wikipedia.org/wiki/Goldfeld%E2%80%93Quandt_test) test. [1] https://github.com/statsmodels/statsmodels/blob/main/statsmodels/tsa/statespace/mlemodel.py#L4470 [2] https://github.com/statsmodels/statsmodels/blob/main/statsmodels/tsa/statespace/mlemodel.py#L3018
Sounds like there are several issues here, and the core desire is to keep the number of observations in the model from growing too large. You're correct that trimming internal...
Hey thanks for the well-written issue. We'll take a look at this
Let me look into what this would take. It doesn't seem like this would be too difficult
When you say `20 thousand timeseries` do you mean 20k samples? Can you please provide a bit more information, like how you're triggering these model fits on Spark executors, and...