BTW here is a version of github-actions-benchmark that aggregates results by running the benchmarks on buidlkite. In my experience, running benchmarks on the github free runners gives very stochastic results.