Bogumił Kamiński

Results 185 issues of Bogumił Kamiński

In https://github.com/JuliaData/DataFrames.jl/pull/3096#issuecomment-1175284117 there is a question on DataFrames.jl+Pluto.jl integration. The issue is: > Pluto.jl does some customization if the `class` in `div` and `table` is `data-frame`. Some of those customization...

other packages
display & PlutoRunner

I think it would greatly improve the user experience of MLJ.jl if https://alan-turing-institute.github.io/DataScienceTutorials.jl/ were better exposed. Now the link is at the bottom of the page, and none of my...

While I am implementing https://github.com/JuliaData/DataFrames.jl/pull/3055 let us discuss if we want some DataFrames.jl specific namespace of metadata. Following Apache Arrow, which reserves `ARROW:` prefix for keys, we could say that...

decision
display

This PR waits for https://github.com/JuliaData/DataAPI.jl/pull/48. I have done an initial implementation. Now we need to discuss for which methods metadata propagation should happen. For now I have implemented it for...

feature
metadata

This issue is meant to gather requirements for all `join*` operations regarding the resulting order of rows after the join operation. After we finish the discussion I will implement it...

priority
feature
joins

There are two natural cases in `describe` that are currently hard: 1. get the number of rows; 2. get the number of non-missing rows. @nalimilan - would you have any...

feature

We can use `findall` instead of current approach which will be faster with e.g. https://github.com/andyferris/AcceleratedArrays.jl

performance

The implementation would be: ``` maximum(df::AbstractDataFrame, col::ColumnIndex) = df[argmax(df.col), :] minimum(df::AbstractDataFrame, col::ColumnIndex) = df[argmin(df.col), :] maximum(gdf::GroupedDataFrame, col::ColumnIndex) = combine(gdf, sdf -> maximum(sdf, col)) minimum(gdf::GroupedDataFrame, col::ColumnIndex) = combine(gdf, sdf -> maximum(sdf,...

feature

The infrastructure is ready for this. We just need to decide when it is worth to do it, make, and document changes.

performance