zzlol63 issues

Repositories
Issues
Comments

Results 2 issues of


                                            zzlol63

Add flash-attn support for Windows

On Windows, PyTorch does not come pre-compiled with support for FlashAttention in the `torch.nn.functional.scaled_dot_product_attention` method, whereas on Linux it does, meaning there is a performance gap between the two. This...

[Feat]: Attention backend selection for Diffusers

### Describe your use-case. The latest version of Diffusers supports being able to configure or select a specific attention backend such as FlashAttention-2/FlashAttention-3 (which supports backward pass). OneTrainer could potentially...

enhancement