rectified-flow-pytorch icon indicating copy to clipboard operation
rectified-flow-pytorch copied to clipboard

Image and Video Generation with Flows

Open EngEmmanuel opened this issue 4 months ago • 3 comments

I am interested in doing image and video generation using mean flow and other variants. Ideally, I would be able to use both UNet and Transformer architectures but UNet is my priority. This would involve using different bits of code you have graciously contributed e.g. this repository, 'imagen-pytorch' and 'x-transformers'

How do you think it's best to go about this? For example, I could try and tweak the Unet3D class from 'imagen-pytorch' to work with different flow classes you have written in this repo. Or instead, try and take the flows code over to the imagen-pytorch set up instead. I would also like to be able to use classifier free guidance.

Where is best to start? I can try to contribute if you can provide some pointers, please

EngEmmanuel avatar Aug 07 '25 09:08 EngEmmanuel