Optimizing-SGEMM-on-NVIDIA-Turing-GPUs icon indicating copy to clipboard operation
Optimizing-SGEMM-on-NVIDIA-Turing-GPUs copied to clipboard

event时间统计有问题

Open alg-leon opened this issue 3 years ago • 0 comments

for (n_count = 0; n_count < N; n_count++) {
        cudaEventRecord(beg);
        test_kernel(kernel_num, m, n, k, alpha, dA, dB, beta, dC, err);
        cudaEventRecord(end);
        cudaEventSynchronize(beg);
        cudaEventSynchronize(end);
        cudaEventElapsedTime(&ms, beg, end);
        elapsed_time += ms;
      }

这种方式统计时间更准确

alg-leon avatar Jan 11 '22 11:01 alg-leon