mgcpp Make an efficient CUDA microbenchmark framework

Make an efficient CUDA microbenchmark framework

Open Red-Portal opened this issue 7 years ago • 1 comments

Make a efficient CUDA micro benchmark framework

The current workflow of writing/optimizing CUDA kernels is very difficult because there is no proper, consistent way of measuring the performance of kernels. A simple and consistent tool to measure and profile CUDA kernels is required.

Requirements

Automatic measuring of FLOPS (probably using nvprof)
Measuring of parallel scaling
Simple, nutshell API
Plotting the benchmark reports (probably using pyplot, gnuplot)

Dec 20 '17 14:12 Red-Portal

working on this on a separate repository https://github.com/MGfoundation/mgbench

Jan 14 '18 16:01 Red-Portal

mgcpp mgcpp copied to clipboard

Make an efficient CUDA microbenchmark framework

Requirements

mgcpp
mgcpp copied to clipboard