Matthieu Gomez comments

Results 174 comments of


                                            Matthieu Gomez

trafficstars

`fit` is very slow for new formulas

Btw I think the slowdown comes from missing_omit that creates a new namedtuple type depending on variables in the formula.

`fit` is very slow for new formulas

I think it’s still about specialization — it’s just that everything after missing_omit is respecialized to the new dataset. Yes I think the way forward would be to write missing_omit...

`fit` is very slow for new formulas

cf https://github.com/JuliaData/TableOperations.jl/issues/7

Specify certain variables as categorical

Special casing would be great. One tiny drawback is that `categorical` is a bit verbose for something that common (in Stata, one can simply write `i.x`), but that's a really...

Specify certain variables as categorical

No it does not (if I understand your question correctly). For now, using `categorical` in the formula fails because the package tries to apply the function elementwise. ```julia using DataFrames,...

Incorrect Default Coding of Union{Missing, Int or Float64}

Still, the issue is what happens when the variable is `Vector{Union{

Incorrect Default Coding of Union{Missing, Int or Float64}

Note that for now ```julia using StatsModels N = 10_000_000 x = rand(N) + rand([0, missing], N) df = (x = x,) schema(Term(:x), df) #julia(20271,0x116991dc0) malloc: can't allocate region #:***...

Incorrect Default Coding of Union{Missing, Int or Float64}

yes, just connecting it to https://github.com/FixedEffects/FixedEffectModels.jl/issues/99

coefnames should always return a tuple no mattter the number of terms

I agree with this. A related suggestion is that the RHS of the formula could always return a tuple of terms no matter the number of terms. The current situation...

Use some character other than * for "main effects and interactions"

I also think the current situation is confusing and I would rather have any of the solution mentioned in the thread.