JuliaDB.jl
JuliaDB.jl copied to clipboard
Parallel analytical database in pure Julia
On the dataset defined [here](https://github.com/Rdatatable/data.table/wiki/Benchmarks-%3A-Grouping), a group operation is extremely slow on JuliaDB compared to DataFrames. The following benchmark was done on dataset of size N=1e8 from the link above....
I get the below error when trying to perform a join on distributed tables. Here's a MWE: ``` t1 = table((lk = [1,1,2,2,3], v1 = rand(5)), chunks = 2) t2...
As mentioned in #324 there are other issue that we also need to fix before tests pass.
OnlineStats does not export Weight. ```julia (@v1.4) pkg> add JuliaDB Updating registry at `~/.julia/registries/General` Updating git-repo `https://github.com/JuliaRegistries/General.git` Resolving package versions... Installed RecipesBase ─ v1.0.1 Installed Intervals ─── v1.1.0 Installed TimeZones...
Replaces #129 TODO: - [x] Wire up chunked reading to `loadtable` - [x] Split blocks across multiple workers - [x] Don't scale block size by file size - [x] Write...
I think it would be great if JuliaDB could save some form of metadata (maybe a dictionary) as a kind of header in the same file as the database. This...
The following code segfaults. Details: - tested on Julia v1.3.0 and 1.0.3 - the crash is related to the string column `zz`: no crash occurs in its absence - saving...
Hello, I am beginner in Julia. I try to import the big file of the french establishments (file opendata "sirene": http://files.data.gouv.fr/sirene/sirene_201711_L_M.zip). I used this code or derivatives codes Addprocs ()...
Here are the minimal steps that can be used to reproduce this issue on Windows - Execute following code and copy the path of variable `n` ``` using JuliaDB t...
Included a runnable MWE. The file is less than 1mb but just seems to hang in the terminal in Julia 1.2.0 Windows 10, but is working fine on Julia 1.1.1...