mgcpp icon indicating copy to clipboard operation
mgcpp copied to clipboard

Make an efficient CUDA microbenchmark framework

Open Red-Portal opened this issue 7 years ago • 1 comments

Make a efficient CUDA micro benchmark framework

The current workflow of writing/optimizing CUDA kernels is very difficult because there is no proper, consistent way of measuring the performance of kernels. A simple and consistent tool to measure and profile CUDA kernels is required.

Requirements

  • Automatic measuring of FLOPS (probably using nvprof)
  • Measuring of parallel scaling
  • Simple, nutshell API
  • Plotting the benchmark reports (probably using pyplot, gnuplot)

Red-Portal avatar Dec 20 '17 14:12 Red-Portal

working on this on a separate repository https://github.com/MGfoundation/mgbench

Red-Portal avatar Jan 14 '18 16:01 Red-Portal