Pramod Kumbhar

Results 229 comments of Pramod Kumbhar

@adamjhn or @ramcdougal: do you have a suggestion for this issue?

> With perf I got close to measuring false sharing but didn't get to the point where I could focus on a specific section of code With the `perf c2c`...

I believe there is a somewhere inconsistency: I did get the same error: ```bash CMake Error at /tmp/pip-req-build-pq0o2oig/3rdparty/cudnn-frontend/cmake/cuDNN.cmake:3 (find_path): Could not find CUDNN_INCLUDE_DIR using the following files: cudnn.h Call Stack...

Is this still an issue? The fix I suggested above was merged in https://github.com/NVIDIA/TransformerEngine/pull/1589.

> I don't have SVE hardware available to me > I guess we can start with emulation. Brief googling says that emulators are available. Until last year there was no...

Just to update here @georgemitenkov : I have tested few small examples and SSE vs AVX-2 examples locally. But for detailed analysis, I will wait for #611 ( / #612)...

> Great! I had an exam yesterday so Monday/Tuesday were a bit out for me. Oh ok! Np! > What do you think? @pramodk Yup, above part LGTM!

> One difference that might be unintended is that now an array of value does cause printing information for each entry. Rather it'll print something for the first value and...

Hello @lee218llnl! Somehow, I missed this notification. Sorry! > There is nothing explicit in STAT for AI/ML applications, but that being said, STAT just gathers stack traces from application processes,...

I ended up on this PR while reviewing [some nvidia framework docs](https://nvidia.github.io/nvidia-resiliency-ext/inprocess/usage_guide.html#known-issues). I am wondering what is blocking for this and if previously mentioned issues still exist and blocker here.