hashbrown Making benchmarks better

The benchmarks should really be using criterion It would be interesting to benchmark it against bytell-hash-map which I did rust implementation for (Haven't optimised it that much and should really make it usable library).

Oct 30 '18 14:10 WaDelma

Also each benchmarked hashmap should be using the same hashing algorithm as otherwise you are comparing apples and oranges

Oct 30 '18 14:10 WaDelma

Current discussion on r/cpp about benchmark many hashmaps: https://www.reddit.com/r/cpp/comments/auwbmg/hashmap_benchmarks_what_should_i_add/

It may give ideas about new benchmarks and new hashmaps to consider.

Also, it's notable that Abseil's Swiss Table seem to struggle in benchmarks featuring insert/erase. In the benchmarks in the README we see hashbrown lagging 10% behind FxHashMap in grow_by_insertion, and otherwise always being faster, so it's unclear if hashbrown also suffers from this.

Feb 28 '19 18:02 matthieu-m

@matthieu-m Are the benchmark results available somewhere?

Feb 28 '19 19:02 Amanieu

Not that I know of, yet, short of running them yourself. At the moment the OP is gathering maps and benchmarks ideas, I am hoping that once everything is setup he'll publish some results.

Mar 01 '19 18:03 matthieu-m

For benchmarks you should replicate these. They are really good. They test:

LookupHit
LookupMiss
Insert
InsertErase (repeatedly insert and erase)
Iteration
Clear+InsertOrdered (ordered = same order as iteration order of the hashtable)
Clear+InsertUnordered (unordered = random order)

The tests are parametrized to run across the cross product of these dimensions:

payload sizes (4, 8, 16, 32, 64)
hot and cold tables (hot=in cache, cold=not in cache)
high and low density tables (high=max load factor, low=min load factor)

There is a good discussion in the forum about the design of these benchmarks and how they are useful. The most notable thing we found about benchmarks when implementing SwissTable is that benchmarks that predict production performance are very hard to write. In the end we wrote benchmarks to act as tools of understanding how a table behaves under certain loads and conditions: lookup/insert/miss, hot/cold, dense/sparse, etc. The only benchmarks we found useful for the hashtable as a whole was running large production workloads against them. In your case you have rustc-perf.

Jun 26 '19 22:06 alkis

hashbrown hashbrown copied to clipboard

Making benchmarks better

hashbrown
hashbrown copied to clipboard