Matthieu Gomez

Results 174 comments of Matthieu Gomez
trafficstars

Btw I think the slowdown comes from missing_omit that creates a new namedtuple type depending on variables in the formula.

I think it’s still about specialization — it’s just that everything after missing_omit is respecialized to the new dataset. Yes I think the way forward would be to write missing_omit...

cf https://github.com/JuliaData/TableOperations.jl/issues/7

Special casing would be great. One tiny drawback is that `categorical` is a bit verbose for something that common (in Stata, one can simply write `i.x`), but that's a really...

No it does not (if I understand your question correctly). For now, using `categorical` in the formula fails because the package tries to apply the function elementwise. ```julia using DataFrames,...

Still, the issue is what happens when the variable is `Vector{Union{

Note that for now ```julia using StatsModels N = 10_000_000 x = rand(N) + rand([0, missing], N) df = (x = x,) schema(Term(:x), df) #julia(20271,0x116991dc0) malloc: can't allocate region #:***...

yes, just connecting it to https://github.com/FixedEffects/FixedEffectModels.jl/issues/99

I agree with this. A related suggestion is that the RHS of the formula could always return a tuple of terms no matter the number of terms. The current situation...

I also think the current situation is confusing and I would rather have any of the solution mentioned in the thread.