Ling Yang
Ling Yang
Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
SGDiff
Official implementation for "Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training" https://arxiv.org/abs/2211.11138
VQGraph
[ICLR 2024] VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs
ContextDiff
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
RealCompo
[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
EditWorld
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
buffer-of-thought-llm
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
consistency_flow_matching
Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"
VideoTetris
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation