Add chDB to the DuckDB benchmark(former h2o)
Add chDB to the benchmark(former h2o) https://duckdblabs.github.io/db-benchmark/
I've started working on it. Please advice which approach is better to use.
There are several options for CSV query implementations
- direct chdb.query
- create connection and then conn.query
- create a session
- use dbapi connection
I'm currently thinking about session
In current version. Connection based API (No.2) is the fastest
Hi @auxten @wudidapaopao I've prepared drafts calculations for the benchmark.
Test were run on local machine - macOS 15.5, M1 Pro (32 Gb RAM) Some queries are significantly slower than DuckDB
I've used similar table structures and queries as they were used in ClickHouse solution. To add results to the official benchmark, they are asking to run them also on large AWS (quote below) and will be great to be able to add results to official repo as well, but I will need some help from your team
The benchmark will now be updated upon request. A request can be made by creating a PR with a combination of the following.
The PR must include
updates to the time.csv and log.csv files of a run on a c6id.metal machine. If you are re-enabling a query for a solution, you can just include new times and logs for the query, however, the version must match currently reported version.
It would be awesome if you will be able to review the code related to the chdb in my fork:
https://github.com/cyrusmsk/db-benchmark/pull/1
Keep in mind that currently 2 separate branches used for chdb.session and chdb.connect approaches (but the difference is only in the logic of conn object creation - 2 lines of code)
Join comparison
Group-by comparison
PR is here https://github.com/duckdblabs/db-benchmark/pull/131
@auxten @wudidapaopao the results were added! https://duckdblabs.github.io/db-benchmark/ The author of the repo tested on 3.6.0 though.. maybe on 3.7.0 results will be a bit better. Also some issues were observed when it tried to use 50GB file.
But in general this task could be closed I think now