Bogumił Kamiński
Bogumił Kamiński
I have not added the tests yet. Let us first decide if we like the design of what I propose and then I will finalize the PR. The proposed change...
Currently we have: ``` julia> x = [1:3, 4:6, 7:9] 3-element Array{UnitRange{Int64},1}: 1:3 4:6 7:9 julia> var(x) 3-element Array{Float64,1}: 9.0 9.0 9.0 julia> var(skipmissing(x)) ERROR: MethodError: no method matching realXcY(::StepRangeLen{Float64,Base.TwicePrecision{Float64},Base.TwicePrecision{Float64}},...
Currently `quantile` does not support `AbstractArrays` except for `AbstractVectors`. The reason is that we use `collect` for a general iterable, which retains its dimensionality. You can write ``` quantile(vec([1 2;...
Related https://julialang.zulipchat.com/#narrow/stream/274208-helpdesk-.28published.29/topic/vcat.20dataframes.20with.20pooledarrays Currently we use `Tables.allocatecolumn` as it is safe. E.g. currently we have an issue with PooledArrays.jl https://github.com/JuliaData/PooledArrays.jl/issues/88 which current design handles correctly. Probably it would be better to...
In https://github.com/bkamins/Julia-DataFrames-Tutorial/blob/master/04_loadsave.ipynb I had to disable CSVFiles.jl file reading tests as it failed to load a small file (that reads in a few seconds otherwise) in any reasonable time. The...
Consider the following code: ``` julia> df = DataFrame(x = [',','\n', ',']) 3×1 DataFrame │ Row │ x │ │ │ Char │ ├─────┼──────┤ │ 1 │ ',' │ │...
Because the default `nastring` is `NA` there is a following problem: 1. take a data structure that has e.g. `String` column with missing data in it; 2. `save` it to...
What I think should be done with tests: 1. separate logging of correctness tests (that can go to console) and performance tests (that I think we should use separate logger...
Add sections: 1. Contributors 2. Usage examples 3. Corrected benchmark plots (or something else that clearly shows the benchmark results). Here ideally we should do benchmarks on different machine sizes...