big_data_benchmarks Only aggregations

Only aggregations

Open maartenbreddels opened this issue 5 years ago • 1 comments

This is a bit of a better comparison and makes dask run. Instead of materializing a column, we nog aggregate (take the mean). And we don't ask dask to materialize the filtered dataframe. These are my result using vaex-hdf5 (parquet is much slower):

And vaex:

Jan 23 '20 19:01 maartenbreddels

With parquet is slightly faster, but I cannot run the filtered part

Jan 23 '20 20:01 maartenbreddels

big_data_benchmarks big_data_benchmarks copied to clipboard

Only aggregations

big_data_benchmarks
big_data_benchmarks copied to clipboard