Igor Poletaev
Results
1
issues of
Igor Poletaev
### Description Using of a start position index in a fused attention kernel does not work. ### Steps to reproduce ``` START_IDX = 128 def attention_reference(q: torch.Tensor, k: torch.Tensor, v:...