quickwit Follow search performance on basic queries

Follow search performance on basic queries

Open fmassot opened this issue 4 years ago • 3 comments

We could use a restricted set of queries used in tantivy benchmark.

Would be also interesting to know the perf with parallel queries.

Nov 17 '21 12:11 fmassot

Let's split this into subtasks:

[x] Load test-data
- [x] Create indexes via CLI - https://github.com/PSeitz/qw_build_index
- [x] Ingest via CLI
- [ ] Store and retrieve datasets from S3
[ ] Run set of queries
- [ ] Have different queries defined in a yaml or toml
- [ ] Start server via CLI (multiple configs, matrix?, memory layout randomization)
- [ ] Warmup caches
- [ ] Run queries
[ ] Retrieve and store benchmarks in a DB (could be Quickwit :)
[ ] Have a dedicated machine running continuously

Jan 13 '22 07:01 PSeitz

By the way, preparing a good benchmark set will help a lot with using PGO for Quickwit. Without some "generic" sample load it is much more time consuming to prepare PGO-optimized binary.

May 24 '22 15:05 zamazan4ik

Interesting to see how databend is doing it: https://github.com/datafuselabs/databend/issues/3084

Jun 02 '22 11:06 fmassot

Regarding PGO (and Bolt) possibly these links could be helpful:

ScyllaDB results: https://github.com/scylladb/scylladb/pull/10808
Vector results: https://github.com/vectordotdev/vector/issues/15631
Rust experience with LTO + PGO + BOLT: https://kobzol.github.io/rust/rustc/2022/10/27/speeding-rustc-without-changing-its-code.html

Dec 23 '22 00:12 zamazan4ik

quickwit quickwit copied to clipboard

Follow search performance on basic queries

quickwit
quickwit copied to clipboard