Liger-Kernel icon indicating copy to clipboard operation
Liger-Kernel copied to clipboard

DeepSeek Native Sparse Attention (NSA) Kernel

Open qingquansong opened this issue 7 months ago • 3 comments

🚀 The feature, motivation and pitch

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention https://arxiv.org/abs/2502.11089

Potentially useful python reference https://github.com/dhcode-cpp/NSA-pytorch

Alternatives

No response

Additional context

No response

qingquansong avatar Apr 05 '25 06:04 qingquansong