sightglass issues

Emit measurement results immediately

6

@jameysharp mentioned in https://github.com/bytecodealliance/sightglass/issues/202#issuecomment-1269162316 that it would be nice if measurement results were emitted as soon as they were collected. Currently this is not the case: all the measurements for...

abrown

Provide three-state output: "changed", "not changed", "unsure"

Right now, Sightglass uses a single threshold based on a confidence interval computed by Behrens-Fisher to determine whether a sampled statistic shifted between configurations. The result of this is that...

cfallin

Allow `summarize` to aggregate multiple benchmarks into one score

2

When measuring more than one benchmark, it would be nice to be able to aggregate the results into a single score. One common way to do this is to take...

abrown

Add a V8 engine

1

This change adds the beginnings of a new V8 engine to Sightglass. It uses V8's `libwee8` library as the backing engine and constructs a `libengine.so` in C++ that is compatible...

abrown

Detect and warn if the samples are not normally distributed

1

From #138: > Ah, and one more thought: have we considered any statistical analysis that would look for multi-modal distributions (and warn, at least)? If we see that e.g. half...

fitzgen

sightglass-next: implement sightglass-server

2

In order to report performance results based on PRs, we talked about implementing an HTTP server (e.g. in `crates/server`) that would: - listen for incoming `POST` requests that contain JSON...

abrown

Warn when CPU governor is not "performance" on Linux

1

From https://github.com/bytecodealliance/sightglass/issues/138: > Observe CPU governor settings when on a known platform (Linux: `/sys/devices/system/cpu/cpu*/cpufreq/scaling_governor` text file, will usually be `ondemand`, we want `performance`) and warn if scaling is turned on...

fitzgen

Account for varying CPU frequency more robustly

17

Most modern CPUs scale their clock frequency according to demand, and this CPU frequency scaling is always a headache when running benchmarks. There are two main dimensions in which this...

cfallin

Interleave benchmark iterations, not just processes

From https://github.com/bytecodealliance/sightglass/issues/138: > Interleave benchmark runs appropriately. Right now, it looks like the top-level runner does a batch of runs with one engine, then a batch of runs with another....

fitzgen

Allow multiple instantiations per compilation

This would allow us to get more samples in that much less time (could get, say, ten instantiation and execution samples per compilation) but would also let us stress test...

fitzgen

sightglass
sightglass copied to clipboard

Metadata

Emit measurement results immediately

Provide three-state output: "changed", "not changed", "unsure"

Allow `summarize` to aggregate multiple benchmarks into one score

Add a V8 engine

Detect and warn if the samples are not normally distributed

sightglass-next: implement sightglass-server

Warn when CPU governor is not "performance" on Linux

Account for varying CPU frequency more robustly

Interleave benchmark iterations, not just processes

Allow multiple instantiations per compilation

← Metadata

Owner

Metadata

sightglass sightglass copied to clipboard

Metadata

← Metadata

Owner

Metadata

sightglass
sightglass copied to clipboard