Andrii Staikov

Results 12 issues of Andrii Staikov

## Description Move the SDPAToPA-related functionality to the transformation to avoid decoupling the code that needs to be a part of a single component. This would help to make the...

category: continuous batching
category: speculative decoding
category: GHA

### Details: The flux.1-schnell model has changes after updating diffusers from 0.33.1 to 0.35.2 which caused another dimensions arrangement for RoPE in the model: [batch, num_heads, -1, head_size] -> [batch,...

category: transformations