diffusion icon indicating copy to clipboard operation
diffusion copied to clipboard

Implementing Mosaic Diffusion into Patch-Diffusion

Open nam-drun opened this issue 1 year ago • 0 comments

Patch Diffusion can x2 training speed even on 256x256 ImageNet. If this works out between Mosaic Diffusion and Patch-Diffusion, that is potentially x10 cumulative boost. The issue is both have different training script so I'm thinking of copy+paste the features of Mosaic into Patch-Diffusion. Right now, I only ask where to find relevant code for

  1. xFormer+FlashAttention - I'll be trying to swap FlashAttention-1 for FlashAttention-2
  2. Precomputing latent
  3. Low Precision LayerNorm and GroupNorm
  4. FSDP
  5. Scheduled EMA

nam-drun avatar Feb 18 '24 21:02 nam-drun