vllm icon indicating copy to clipboard operation
vllm copied to clipboard

[V1][WIP] 2nd try of Hybrid allocator for full attention & sliding window attention interleaved models

Open heheda12345 opened this issue 2 weeks ago • 3 comments

Trying another implementation of #12655

heheda12345 avatar Feb 14 '25 16:02 heheda12345