DataFrames.jl
DataFrames.jl copied to clipboard
In-memory tabular data in Julia
I have a data frame that contains multiple groups of columns and I want to stack them up. If there were only one group, then the regular `stack` function would...
I feel like the question about datframes with distributed arrays comes up a lot. My impression is that we don't know, for sure, if a Dagger array etc. can "just...
A separate issue to keep track only this feature
It would have to be carefully thought about as `sort` has a complex API, but we could allow things like: ``` sort(df, :x => x -> x^2) ``` (so to...
It would be nice if there was a function to sort rows by some function of an entire `DataFrameRow` instead of by individual column values. An example implementation would be:...
I was bit by this issue in a case that looked like this: ```julia julia> df = DataFrame(x=1:3); julia> select(df, :x => (x->2x), :x => (x->3x)) ERROR: ArgumentError: duplicate output...
Would be nice to have `hcat!` exposed. I think its useful, especially since the only reason to use `hcat` is for performance in the first place.
Currently a single row is still printed as a "DataFrame" - i.e. a 1-row table, with truncated columns. I wonder if it would be more natural to show a single...
Currently, `missing` is treated as the largest value in a sort. pandas has a `na_position` kwarg that lets you specify how missing data should be ordered, by default placing it...