flash-attention icon indicating copy to clipboard operation
flash-attention copied to clipboard

How to start learning to manipulate tensor at low-level like flash-attention?

Open Skimmable-Code-pls opened this issue 7 months ago • 1 comments
trafficstars

I'm keen to manipulate tensor at C++ and CUDA low level, and I can ask chatGPT to translate/explain C++ line-by-line. I just don't know where to start my learning journey

Skimmable-Code-pls avatar Apr 12 '25 08:04 Skimmable-Code-pls

Triton tutorials are a good place to start to learn about how tensors are laid out in memory, and how to read & write to them. After that you can look at Cutlass.

tridao avatar Apr 12 '25 16:04 tridao