cutlass icon indicating copy to clipboard operation
cutlass copied to clipboard

CUDA Templates for Linear Algebra Subroutines

Results 608 cutlass issues
Sort by recently updated
recently updated
newest added

**What is your question?** I recently tried to change the type tags on the [DGEMM examples](https://github.com/NVIDIA/cutlass/blob/main/examples/45_dual_gemm/dual_gemm.cu) to ```cutlass::arch::Sm90```, which caused a load of compile errors. This is primarily because there's...

help wanted
question
inactive-30d
inactive-90d

I'm adding (PR [here](https://github.com/pytorch/pytorch/pull/119986)) CUTLASS kernels as an auto-tune option for PyTorch compiler, and it would be nice to have these additional configurations available. This is not urgent, and more...

inactive-30d

**What is your question?** Hello, I found that many epilogues are element-wise. I wondered if it could be customized to sum up a `2*2` tile instead of an element-wise operation....

question
inactive-30d

I have implemented a basic sample code to convolve a 2D image with a row filter. It works, but when the dst image has some stride, it seems ignored by...

bug

Is s8 * s8 = {s32, s8} supported in cuTe?

question
? - Needs Triage

I am benchmarking sparse and dense GEMMs through the cutlass profiler. I am seeing that sparse GEMMs run **slower** than dense GEMMs in the same scenario. For example, compare the...

question
inactive-30d
inactive-90d

Cutlass profiler has a great set of flags to perform shmoos across different matrix shapes and sizes. While benchmarking GEMMs using the cutlass profiler, one can use Cublas as a...

Dynamic offsets in `DefaultEpilogue` allows to move pointer arithmetics to device and shift `C` and `D` pointers based on offsets stored in device memory. Depends on https://github.com/NVIDIA/cutlass/pull/1273

inactive-30d
inactive-90d

As stands, when a runtime assert is called on CUDA platforms your program just explodes with no stack trace and no mention of the error that was encountered. I just...

inactive-30d
inactive-90d