nvbench icon indicating copy to clipboard operation
nvbench copied to clipboard

CUDA Kernel Benchmarking Library

Results 99 nvbench issues
Sort by recently updated
recently updated
newest added

To implement a Setup, which is shared among all states of a `benchmark_base` a current solution could be: ```cpp void my_benchmark(nvbench::state& state) { static int num_execs = 0; if (execs...

type: enhancement
P1: should have
helps: rapids

To implement Global Setup and TearDown feature, similar to [GoogleTest Global Set-Up and Tear-Down](https://github.com/google/googletest/blob/main/docs/advanced.md#global-set-up-and-tear-down). There are use cases in [cudf ](https://github.com/rapidsai/cudf/blob/acb6aed5bc5929d7ec97a30a3fbfebd86bf0d703/cpp/benchmarks/io/orc/orc_reader.cpp#L199-L203)that could use Global Set-Up and Tear-Down feature, as pointed...

type: enhancement
P1: should have
helps: rapids

Add a `--color` option to `scripts/nvbench_compare.py`, and only import/use colorama when this is explicitly passed in.

type: enhancement
P1: should have
good first issue
python

Adding NVML as an optional dependency would allow some cool features: * Lock clock frequency. * Per-device default frequency. * Per-device maximum frequency. * Explicit frequency. * Log various device...

type: enhancement
P1: should have

Right now it looks like benchmark names must be the full name of the benchmark, but it would be useful to provide a regex syntax as gbench allows.

good first issue

We should add an `output` directory under `nvbench/example` that contains the json, csv, and md outputs for each benchmark on a multidevice system. This should be linked to from the...

good first issue
only: docs

NVBench has a work-in-progress JSON output format and I'm working on a very basic python script to compare two JSON files. We should grow this functionality into a more complete...

type: enhancement
P0: must have

# Overview RAPIDS needs a way to iterate through correlated axes in lockstep. # Example Usecase Consider a benchmark that takes three parameters "X", "Y", and "Z", where X is...

type: enhancement
P2: nice to have
helps: rapids

There are some compilation errors with CUPTI from CTK 11.2: ``` FAILED: nvbench/CMakeFiles/nvbench.dir/cupti_profiler.cxx.o /home/av/code/utils/ccache-install/bin/g++ -DFMT_LOCALE -Dnvbench_EXPORTS -I/home/av/code/src/nvbench -I/home/av/code/build/nvbench-ctk_11.2 -I/home/av/code/build/nvbench-ctk_11.2/_deps/fmt-src/include -I/home/av/code/build/nvbench-ctk_11.2/nvbench/detail -isystem /usr/local/cuda_11.2.0_460.22/include -isystem /usr/local/cuda_11.2.0_460.22/extras/CUPTI/include -isystem /home/av/code/build/nvbench-ctk_11.2/_deps/nlohmann_json-src/include -O3 -DNDEBUG -fPIC -Wall...

P2: nice to have
type: bug: functional

Adds samples of `--list`, `--md`, `--json`, and `--csv` for each example. Fixed a race condition where calling `std::exit(0)` in the parser was skipping ofstream destructors. Some files were never flushed/closed,...

type: enhancement
P2: nice to have