DataFrames.jl icon indicating copy to clipboard operation
DataFrames.jl copied to clipboard

In-memory tabular data in Julia

Results 170 DataFrames.jl issues
Sort by recently updated
recently updated
newest added

I recently encountered a situation where I needed to compute the minimum value of a variable grouped by another variable while ignoring missing values. In Stata, this can be achieved...

decision

@nalimilan, @ronisbr, @pdeffebach - what would you say if we stored display settings for DataFrames.jl in [Preferences.jl](https://github.com/JuliaPackaging/Preferences.jl)? Now we use `ENV`, but it is a bit clumsy and not persistent....

display

Fixes https://github.com/JuliaData/DataFrames.jl/issues/3005 https://github.com/JuliaData/DataFrames.jl/issues/2890 https://github.com/JuliaData/DataFrames.jl/issues/3116 https://github.com/JuliaData/DataFrames.jl/issues/2767 The PR adds `nest` and `unnest` and introduces `scalar` kwarg to `flatten` (which is needed in `unnest`. `flatten` is ready for review. For `nest` and...

feature

It would be great if DataFrames.jl had a function or functions that would function more or less the same way `subset` does, except that they'd would return a vector containing...

feature

This issue is meant to replace: https://github.com/JuliaData/DataFrames.jl/issues/2215 https://github.com/JuliaData/DataFrames.jl/issues/2148 https://github.com/JuliaData/DataFrames.jl/issues/3066 https://github.com/JuliaData/DataFrames.jl/issues/2422 https://github.com/JuliaData/DataFrames.jl/issues/2414 https://github.com/JuliaData/DataFrames.jl/issues/1839 The proposed improved API for `stack` is: ``` stack(df::AbstractDataFrame, measure_vars, id_vars; variable_name=:variable, name_value=identity, value_name=:value, # or function variable_eltype::Type=String,...

feature
reshaping

See https://github.com/JuliaData/DataFrames.jl/pull/1709#discussion_r253771079. This should be changed when https://github.com/JuliaLang/julia/pull/30485 is fixed.

performance
non-breaking

What do you think about adding the following function: ``` findcols(predicate, df::AbstractDataFrame) = findall(predicate, eachcol(df)) ``` Maybe such function would be more readable than using `predicate.(eachcol(df))` for column selection?

feature

x-ref https://github.com/JuliaLang/julia/issues/40626, https://github.com/JuliaData/CSV.jl/pull/1046, https://github.com/JuliaData/CSV.jl/issues/1045

multithreading

@ronisbr for example the following fails: ``` show(stdout, MIME("text/html"), df, allcols=true) ``` although users can expect that the following kwargs would work: ``` allrows::Bool = !get(io, :limit, false), allcols::Bool =...

bug
display

In TSFrames.jl the join function looks like: ```julia function Base.join( ts1::TSFrame, ts2::TSFrame, ts...; jointype::Symbol=:JoinAll ) result = joinmap[jointype](ts1.coredata, ts2.coredata, on=:Index, makeunique=true) for tsf in ts result = joinmap[jointype](result, tsf.coredata, on=:Index,...

joins