Bogumił Kamiński
Bogumił Kamiński
@nalimilan - OK. I did both.
Thank you!
Fixed in https://github.com/JuliaData/DataFrames.jl/pull/3457
Given the discussion - I think adding an argument is most flexible.
Ah - now I understand an issue. I understand that you want to predefine a function that you would pass to `labels` and it would provide a different way of...
And what if assuming we do not touch the cut-points we provide a modification of labels that chooses the shortest label that does not change the split? In other words...
If we use the R-1 quantiles then indeed we have `y`. Then we need to do one pass over data to collect for each quantile the greatest value lower then...
Also: ``` julia> @formula(price ~ bdrms + identity(sqrft - 150.0 * bdrms)) ┌ Warning: Number 150.0 removed from interaction term 150.0 & bdrms └ @ StatsModels C:\Users\bogum\.julia\packages\StatsModels\fK0P3\src\formula.jl:158 FormulaTerm Response: price(unknown)...
@nalimilan - this issue should be migrated to Statistics.jl but I do not have privileges to do so. Could you please do it? Thank you!
I have never built a system image. Are you trying do do `groupby` on a very wide table with lots of `by`?