Andrii Staikov
Results
12
issues of
Andrii Staikov
## Description Move the SDPAToPA-related functionality to the transformation to avoid decoupling the code that needs to be a part of a single component. This would help to make the...
category: continuous batching
category: speculative decoding
category: GHA
### Details: The flux.1-schnell model has changes after updating diffusers from 0.33.1 to 0.35.2 which caused another dimensions arrangement for RoPE in the model: [batch, num_heads, -1, head_size] -> [batch,...
category: transformations