cuda-profiler icon indicating copy to clipboard operation
cuda-profiler copied to clipboard

Tools and extensions for CUDA profiling

Results 7 cuda-profiler issues
Sort by recently updated
recently updated
newest added

Dear developers: How to reduce Nvporf output nvvp file sizes through re-sampling events? One of the options I can find from Nvprof help that I thought it might work is...

nvtx_pmpi interfaces Fortran MPI_* calls to C PMPI_* calls itself, rather than leaving that step up to the underlying MPI library. Unfortunately it gets some things wrong in the process,...

The dlprof tool analyzed the deep model and proposed that the data shape did not meet the requirements of tensor core. The original script set five full connection layers, namely,...

I'm running Visual Profiler on Windows and try to remotely profile ubuntu machine. I don't have Nvidia GPU on my Windows. When trying to create new session, I got the...

Dear Nvprof developers: I want to use nvprof to profile my cuda+mpi application. But the little test shows that the options --annote-mpi openmpi does not produce any information about MPI...

git reflog 5a6577f (HEAD -> master, origin/master, origin/HEAD) HEAD@{0}: clone: from https://github.com/NVIDIA/cuda-profiler.git pgcc --version pgcc 19.5-0 LLVM 64-bit target on x86-64 Linux -tp sandybridge PGI Compilers and Tools Copyright (c)...

This adds MPI_Init_thread to the wrapped calls.