multimodal-generation topic

List multimodal-generation repositories

Text2Poster-ICASSP-22

203
Stars
16
Forks
Watchers

Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"

UniteandConquer

34
Stars
3
Forks
Watchers

[CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models

ContextDiff

56
Stars
3
Forks
Watchers

[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation

MiniGPT-5

845
Stars
52
Forks
Watchers

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Awesome-LLMs-meet-Multimodal-Generation

322
Stars
17
Forks
Watchers

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).