multimodal-generation topic
                        List
                        multimodal-generation repositories
                    
                Text2Poster-ICASSP-22
                            
                                203
                            
                            
                        
                        Stars
                    
                            
                                16
                            
                            
                        
                        Forks
                    Watchers
                    Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"
UniteandConquer
                            
                                34
                            
                            
                        
                        Stars
                    
                            
                                3
                            
                            
                        
                        Forks
                    Watchers
                    [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models
ContextDiff
                            
                                56
                            
                            
                        
                        Stars
                    
                            
                                3
                            
                            
                        
                        Forks
                    Watchers
                    [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
MiniGPT-5
                            
                                845
                            
                            
                        
                        Stars
                    
                            
                                52
                            
                            
                        
                        Forks
                    Watchers
                    Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Awesome-LLMs-meet-Multimodal-Generation
                            
                                322
                            
                            
                        
                        Stars
                    
                            
                                17
                            
                            
                        
                        Forks
                    Watchers
                    🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).