flashinfer icon indicating copy to clipboard operation
flashinfer copied to clipboard

[Feature Request] TopK Sparse Attention

Open yzh119 opened this issue 3 months ago • 3 comments

  • https://huggingface.co/openbmb/MiniCPM4.1-8B

yzh119 avatar Sep 16 '25 17:09 yzh119

From @simon-mo, the ask here is for both Hopper and Blackwell support.

Example kernel code: https://github.com/OpenBMB/infllmv2_cuda_impl

sricketts avatar Sep 16 '25 22:09 sricketts

hello, i'd like to try implementing it if possible

Liu-congo avatar Oct 03 '25 09:10 Liu-congo

hello, i'd like to try implementing it if possible

Sounds great! I'm not aware of anyone else working on this.

cc @yzh119

sricketts avatar Oct 03 '25 18:10 sricketts