uarch-bench issues

Disable turbo on AMD

6

The current solution in `uarch-bench.sh` to disable turbo mode (write to msr regs) only works for Intel, but we should come up with a solution for AMD. I didn't find...

travisdowns

enhancement

help wanted

store forwarding does not have fixed latency

11

You note the effect for Skylake on the wiki ("Minimum store-forwarding latency is 3 on new(ish) chips, but the load has to arrive at exactly the right time to achieve...

amonakov

Fail with diagnostic if submodules don't exist

8

Many users will probably just `git clone` followed by `make` and the submodules will be missing in that case, but the failure mode is a bit obscure: you'll get some...

travisdowns

Consider making TimerInfo::now() non-static

Currently, each `TIMER` implementation should implement a static `now()` method, which is used in the core of the benchmark loop in order to take time (or PMU or whatever other...

travisdowns

Reconsider usage of sudo in uarch-bench.sh

7

*uarch-bench.sh* contains multiple calls to `sudo`, which obviously won't work if the system doesn't have `sudo` installed (Debian minimal doesn't come installed with sudo). You could test to see if...

zingaburga

libpfc timer should include kernel cycles too

1

Currently we are only getting user-mode cycles from libpfc. We should probably track all cycles.

travisdowns

Create web version of the tool

3

I really like your tool, Travis! And it would be even more cool to have web version like https://gcc.godbolt.org/ or http://quick-bench.com/ The use case that I'm looking for is that...

dendibakh

enhancement

help wanted

Better table printing

Right now we just hardcode some column sizes, an annoying tradeoff in many cases. Let's property pretty-print our tables with dynamic column sizing.

travisdowns

Disable MSR-based turbo disabling on non-intel CPUs

If you run `./uarch-bench.sh` on AMD, it will fail when it tries to write to the turbo-disabling MSRs which are Intel-specific. We should skip this step on AMD.

travisdowns

help wanted

Implement "delta" measurement

1

Currently we just measure the absolute time of the code under test like so: static int64_t time_method(size_t loop_count) { auto t0 = CLOCK::now(); METHOD(loop_count); auto t1 = CLOCK::now(); return t1...

travisdowns

enhancement

uarch-bench
uarch-bench copied to clipboard

Metadata

Disable turbo on AMD

store forwarding does not have fixed latency

Fail with diagnostic if submodules don't exist

Consider making TimerInfo::now() non-static

Reconsider usage of sudo in uarch-bench.sh

libpfc timer should include kernel cycles too

Create web version of the tool

Better table printing

Disable MSR-based turbo disabling on non-intel CPUs

Implement "delta" measurement

← Metadata

Owner

Metadata

uarch-bench uarch-bench copied to clipboard

Metadata

← Metadata

Owner

Metadata

uarch-bench
uarch-bench copied to clipboard