cutlass topics

Cutlass_EX

17

Stars

4

Forks

Watchers

study of cutlass

yester31

cmake

cpp17

cuda

cutlass

flash_attention_inference

20

Stars

2

Forks

Watchers

Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.

Bruce-Lee-LY

cuda

cutlass

flash-attention

flash-attention-2

flux

650

Stars

42

Forks

Watchers

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

bytedance

cuda

cutlass

gpu

pytorch

whl

17

Stars

4

Forks

17

Watchers

Kernel Library Wheel for SGLang

sgl-project

cuda

cutlass

flashinfer

sglang