Feature request: ability to measure benchmark deltas across commits/branches
Use case: Locker Service would like to measure the performance of the browser eval() vs Locker secureEval(). We can write Best benchmarks to independently measure each, but there's no way AFAICT to measure the delta between the two, across commits.
So for instance, if eval() itself regresses between Chrome 71 and Chrome 72, we don't care. But if the delta between our secureEval() and eval() regresses, then we actually care. We can track the two independently and infer this, but it would be better if Best had something built-in to handle this use case.
Other use cases that may want this scenario: measuring LWC vs Aura, polyfill vs native, etc.
Not sure I understand this use case. Let's talk