Danial Javady

Results 31 comments of Danial Javady

@justinchuby I'll give this one a go! I'll take a look at `split_12_13.h` as an example.

Wouldn't subsequent runs after the first one rule out downloading? With that being said the internet connection is at around 900 - 1100 mbps. Definitely not the problem. Important to...

Out of boredom and curiosity I tried to update the code on my own to accommodate for the breaking changes. For now it seems to build with cudnn 9, I...

I have a PR up for this but it seems like one of hte builds broke. I don't have time to fix this for a while but I am pretty...

@miscco Could you comment please?

@hwu36 [comparison_hgrad.csv](https://github.com/NVIDIA/cutlass/files/14878462/comparison_hgrad.csv) Not seeing benefits from this one either. ran ``` ./tools/profiler/cutlass_profiler --kernels=cutlass_tensorop_h16816dgrad_optimized_* --n=34 --h=28 --w=28 --c=512 --k=1024 --r=1 --s=1 --pad_h=0 --pad_w=0 --stride_h=2 --stride_w=2 --dilation_h=1 --dilation_w=1 ```

@manishucsd Sorry that file isn't complete, please ignore. I'll paste the complete one here(also with running align8 only) THis one will have load, store, load and store, and normal GFLOPS...

@janeyx99 @ptrblck 2nd PR! 🎉 Sorry it took me some days. I'm learning Pytorch internals independently so I'm still learning the codebase. Also I'm curious to know if there's a...

@lezcano My apologies! I've cleaned it up. I missed some new lines from when I was cleaning up my debugging/experimenting code so I could understand the codebase.