DiT
DiT copied to clipboard
about fused_attention
Thank you for work
I was wondering what's the difference between fuse attention you use in timm and flash attention?