cutlass icon indicating copy to clipboard operation
cutlass copied to clipboard

CUDA Templates for Linear Algebra Subroutines

Results 608 cutlass issues
Sort by recently updated
recently updated
newest added

`-I/usr/local/cuda/include/cccl` in cuda 13 (related #2543) remove duplicated `--cuda-gpu-arch` `_LIBCUDACXX_STD_VER` is deprecated and not used by the project

inactive-30d

Ports over some of the latex printing functionality from C++ and adds an example.

inactive-30d

### Which component has the problem? CuTe DSL ### Bug Report **Describe the bug** **Steps/Code to reproduce bug** ``` import cutlass.cute as cute @cute.jit def test(): layoutA = cute.make_layout((4, 4),...

bug
? - Needs Triage
inactive-30d
CuTe DSL

**What is your question?** I am trying to use cutlass on Ampere architecture to multiply two rectangular matrix MxK and KxN where M and N are small (say 16) and...

question
? - Needs Triage
inactive-30d

### Which component has the problem? CuTe DSL ### Bug Report Building nvidia-cutlass-dsl with dynamic versioning always produces a wheel with version 0.0.0 due to missing VERSION.EDITABLE. Suggest using setuptools-scm...

bug
? - Needs Triage
CuTe DSL

Summary ------- Implements dual-GEMM examples for SM90 (Hopper) and SM120 (Blackwell) using CUTLASS 3.x. The dual-GEMM operation implemented is: ``` D0 = epilogue0(X @ B0, C0) D1 = epilogue1(X @...

inactive-30d

I need to fix blockM and blockN to ensure batch invariance. Can the CUTLASS gemm interface control this? using GemmKernel = cutlass::gemm::kernel::GemmUniversal< cute::Shape, CollectiveMainloop, CollectiveEpilogue>; thanks in advance!

question
? - Needs Triage
inactive-30d

### Which component has the problem? CuTe DSL ### Bug Report **Describe the bug** with nvidia-cutlass and nvidia-cutlass-dsl 4.2.0.0 ``` python cutlass/examples/python/CuTeDSL/blackwell/tutorial_gemm/fp16_gemm_1.py nvidia_cutlass_dsl/python_packages/cutlass/cute/nvgpu/tcgen05/mma.py", line 153, in __post_init__ raise OpError( cutlass.cute.nvgpu.common.OpError:...

bug
? - Needs Triage
CuTe DSL

### Which component has the problem? CuTe DSL ### Bug Report **Steps/Code to reproduce bug** ``` import torch import cutlass import cutlass.cute as cute from cutlass.cute.runtime import from_dlpack @cute.jit def...

bug
? - Needs Triage
inactive-30d
CuTe DSL