Bogumił Kamiński

Results 185 issues of Bogumił Kamiński

This issue is meant to replace: https://github.com/JuliaData/DataFrames.jl/issues/2215 https://github.com/JuliaData/DataFrames.jl/issues/2148 https://github.com/JuliaData/DataFrames.jl/issues/3066 https://github.com/JuliaData/DataFrames.jl/issues/2422 https://github.com/JuliaData/DataFrames.jl/issues/2414 https://github.com/JuliaData/DataFrames.jl/issues/1839 The proposed improved API for `stack` is: ``` stack(df::AbstractDataFrame, measure_vars, id_vars; variable_name=:variable, name_value=identity, value_name=:value, # or function variable_eltype::Type=String,...

feature
reshaping

See https://github.com/JuliaData/DataFrames.jl/pull/1709#discussion_r253771079. This should be changed when https://github.com/JuliaLang/julia/pull/30485 is fixed.

performance
non-breaking

What do you think about adding the following function: ``` findcols(predicate, df::AbstractDataFrame) = findall(predicate, eachcol(df)) ``` Maybe such function would be more readable than using `predicate.(eachcol(df))` for column selection?

feature

x-ref https://github.com/JuliaLang/julia/issues/40626, https://github.com/JuliaData/CSV.jl/pull/1046, https://github.com/JuliaData/CSV.jl/issues/1045

multithreading

@ronisbr for example the following fails: ``` show(stdout, MIME("text/html"), df, allcols=true) ``` although users can expect that the following kwargs would work: ``` allrows::Bool = !get(io, :limit, false), allcols::Bool =...

bug
display

We have most problematic performance in this case.

grouping

I thought it is not needed, but now I think it is 😄.

reshaping

We currently have a limited support for multi-threading in joins. It would be good to improve it in the future.

performance
joins

x-ref https://github.com/JuliaLang/julia/pull/47695

ecosystem

x-ref https://github.com/JuliaData/DataFrames.jl/issues/3411