diffusers icon indicating copy to clipboard operation
diffusers copied to clipboard

Add Latte: Latent Diffusion Transformer for Video Generation

Open kabachuha opened this issue 1 year ago • 1 comments

Model/Pipeline/Scheduler description

Latte is a text2video diffusion transformer (similar to Sora), improving past the DiT and PixArt-alpha text2image models

The implementation is already based on diffusers (see latte_t2v.py), so adding it here should be a straightforward task

Open source status

  • [X] The model implementation is available.
  • [X] The model weights are available (Only relevant if addition is not a scheduler).

Provide useful links for the implementation

The official repo https://github.com/Vchitect/Latte Model on Huggingface: https://huggingface.co/maxin-cn/Latte Paper: https://arxiv.org/abs/2401.03048v1 Project page: https://maxin-cn.github.io/latte_project/

kabachuha avatar Mar 05 '24 16:03 kabachuha

Thanks for bringing this to our notice. But as far as I understand it from here, the current model suffers from the issue of producing watermarked videos. Maybe let's wait till they release the unwatermarked version? Cc: @DN6

sayakpaul avatar Mar 06 '24 04:03 sayakpaul

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

github-actions[bot] avatar Apr 05 '24 15:04 github-actions[bot]

@sayakpaul Hi, I am the first author of Latte and we have updated the unwatermarked version of the LatteT2V model. We want to integrate Latte into diffusers library, what should I do? The pre-trained LatteT2V models are here and the codes are here.

maxin-cn avatar Jun 03 '24 07:06 maxin-cn

Ccing @DN6 into this thread for further comments. I am happy to have the model integrated :)

sayakpaul avatar Jun 03 '24 07:06 sayakpaul

Thanks for integrating Latte and your awesome work maxin!

a-r-r-o-w avatar Jul 19 '24 09:07 a-r-r-o-w