Guided-I2I-Translation-Papers
Guided-I2I-Translation-Papers copied to clipboard
Guided Image-to-Image Translation Papers
Guided Image-to-Image Translation papers
Feel free to send a PR or issue. (constantly updating)
- Class Label Guided
- Action Unit Guided
- Facial Landmark Guided
- Pose Guided Person Image Generation
- Segmentation Map Guided Scene Image Generation
- Texture Patch Guided
- Example Guided
- Attention Guided
- Mask Guided
- Text Guided
- Audio Guided
Class Label Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| IcGAN | Invertible Conditional GANs for image editing | NeurIPSW 2016 | 1611.06355 | Guim3/IcGAN |
| Conditional CycleGAN | Conditional CycleGAN for Attribute Guided Face Image Generation | ECCV 2018 | 1705.09966 | |
| StarGAN | StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation | CVPR 2018 | 1711.09020 | yunjey/StarGAN |
| AGUIT | Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning | 1904.12428 | imlixinyang/AGUIT | |
| AttGAN | AttGAN: Facial Attribute Editing by Only Changing What You Want | TIP 2019 | 1711.10678 | LynnHo/AttGAN-Tensorflow |
| SGGAN | Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation | MM 2018 | 1805.07509 | zhangqianhui/Sparsely-Grouped-GAN |
| RelGAN | RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes | ICCV 2019 | 1908.07269 | elvisyjlin/RelGAN-PyTorch, willylulu/RelGAN |
Action Unit Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| GANimation | GANimation: Anatomically-aware Facial Animation from a Single Image | ECCV 2018 | 1807.09251 | albertpumarola/GANimation |
Facial Landmark Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| G2GAN | Geometry Guided Adversarial Facial Expression Synthesis | MM 2018 | 1712.03474 | |
| CMM-Net | Every Smile is Unique: Landmark-Guided Diverse Smile Generation | CVPR 2018 | 1802.01873 | |
| C2GAN | Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation | MM 2019 | 1908.00999 | Ha0Tang/C2GAN |
| Few-Shot Adversarial Learning of Realistic Neural Talking Head Models | ICCV 2019 | 1905.08233 | grey-eye/talking-heads |
Pose Guided Person Image Generation
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| PG2 | Pose Guided Person Image Generation | NeurIPS 2017 | 1705.09368 | charliememory/Pose-Guided-Person-Image-Generation |
| PoseGAN | Deformable GANs for Pose-Based Human Image Generation | CVPR 2018 | 1801.00055 | AliaksandrSiarohin/pose-gan |
| VUnet | A Variational U-Net for Conditional Appearance and Shape Generation | CVPR 2018 | 1804.04694 | CompVis/vunet |
| PoseWarp | Synthesizing Images of Humans in Unseen Poses | CVPR 2018 | 1804.07739 | posewarp-cvpr2018 |
| DPIG | Disentangled Person Image Generation | CVPR 2018 | 1712.02621 | charliememory/Disentangled-Person-Image-Generation |
| FD-GAN | FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification | NeurIPS 2018 | 1810.02936 | yxgeee/FD-GAN |
| PN-GAN | Pose-Normalized Image Generation for Person Re-identification | ECCV 2018 | 1712.02225 | naiq/PN_GAN |
| GestureGAN | GestureGAN for Hand Gesture-to-Gesture Translation in the Wild | MM 2018 | 1808.04859 | Ha0Tang/GestureGAN |
| PATN | Progressive Pose Attention for Person Image Generation | CVPR 2019 | 1904.03349 | tengteng95/Pose-Transfer |
| SPT | Unsupervised Person Image Generation with Semantic Parsing Transformation | CVPR 2019 | 1904.03379 | SijieSong/person_generation_spt |
| Coordinate-based Texture Inpainting for Pose-Guided Human Image Generation | CVPR 2019 | 1811.11459 | project | |
| IntrinsicFlow | Dense intrinsic appearance flow for human pose transfer | CVPR 2019 | 1903.11326 | ly015/intrinsic_flow |
| TriangleGAN | Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps | MM 2019 | 1907.05916 | yhlleo/TriangleGAN |
| Pix2pixHD + Temporal Smoothing + FaceGAN | Everybody Dance Now | ICCV 2019 | 1808.07371 | project |
| LiquidWarpingGAN | Liquid warping gan: A unified framework for human motion imitation, appearance transfer and novel view synthesis | ICCV 2019 | 1909.12224 | svip-lab/impersonator |
| Global-Flow-Local-Attention | Deep Image Spatial Transformation for Person Image Generation | CVPR 2020 | 2003.00696 | RenYurui/Global-Flow-Local-Attention |
| ADGAN | Controllable Person Image Synthesis With Attribute-Decomposed GAN | CVPR 2020 | 2003.12267 | menyifang/ADGAN |
| CoCosNet | Cross-domain Correspondence Learning for Exemplar-based Image Translation | CVPR 2020 | 2004.05571 | microsoft/CoCosNet |
| SMIS | Semantically Multi-modal Image Synthesis | CVPR 2020 | 2003.12697 | Seanseattle/SMIS |
| MISC | MISC: Multi-Condition Injection and Spatially-Adaptive Compositing for Conditional Person Image Synthesis | CVPR 2020 | cvpr20 | |
| Warp3d_Reposing | Reposing Humans by Warping 3D Features | CVPR 2020 Workshop | 2006.04898 | MKnoche/warp3d_reposing |
| Wish You Were Here: Context-Aware Human Generation | CVPR 2020 | 2005.10663 | ||
| PoseStylizer | Generating Person Images with Appearance-aware Pose Stylizer | IJCAI 2020 | 2007.09077 | siyuhuang/PoseStylizer |
| XingGAN | XingGAN for Person Image Generation | ECCV 2020 | 2007.09278 | Ha0Tang/XingGAN |
Segmentation Map Guided Scene Image Generation
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| CRN | Photographic Image Synthesis with Cascaded Refinement Networks | ICCV 2017 | 1707.09405 | CQFIO/PhotographicImageSynthesis |
| CrossNet | Predicting Ground-Level Scene Layout from Aerial Imagery | CVPR 2017 | 1612.02709 | viibridges/crossnet |
| SIMS | Semi-parametric Image Synthesis | CVPR 2018 | 1804.10992 | xjqicuhk/SIMS |
| Pix2PixHD | High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs | CVPR 2018 | 1711.11585 | NVIDIA/pix2pixHD |
| X-Fork & X-Seq | Cross-View Image Synthesis using Conditional GANs | CVPR 2018 | 1803.03396 | kregmi/cross-view-image-synthesis |
| Vid2Vid | Video-to-Video Synthesis | NeurIPS 2018 | 1808.06601 | NVIDIA/vid2vid |
| SPADE | Semantic Image Synthesis with Spatially-Adaptive Normalization | CVPR 2019 | 1903.07291 | NVlabs/SPADE |
| SelectionGAN | Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation | CVPR 2019 | 1904.06807 | Ha0Tang/SelectionGAN |
| Art2Real | Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation | CVPR 2019 | 1811.10666 | aimagelab/art2real |
| Mask-Guided Portrait Editing with Conditional GANs | CVPR 2019 | 1905.10346 | cientgu/Mask_Guided_Portrait_Editing | |
| Seg2Vid | Video Generation from Single Semantic Label Map | CVPR 2019 | 1903.04480 | junting/seg2vid |
| Semantic Bottleneck Scene Generation | 1911.11357 | |||
| Few-shot Vid2Vid | Few-shot Video-to-Video Synthesis | NeurIPS 2019 | 1910.12713 | NVlabs/few-shot-vid2vid |
| CC-FPSE | Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis | NeurIPS 2019 | 1910.06809 | xh-liu/CC-FPSE |
| SEAN | SEAN: Image Synthesis with Semantic Region-Adaptive Normalization | CVPR 2020 | 1911.12861 | ZPdesu/SEAN |
| BachGAN | BachGAN: High-Resolution Image Synthesis from Salient Object Layout | CVPR 2020 | 2003.11690 | Cold-Winter/BachGAN |
| Panoptic-based Image Synthesis | CVPR 2020 | 2004.10289 | ||
| SMIS | Semantically Multi-modal Image Synthesis | CVPR 2020 | 2003.12697 | Seanseattle/SMIS |
| GAN Compression | GAN Compression: Efficient Architectures for Interactive Conditional GANs | CVPR 2020 | 2003.08936 | mit-han-lab/gan-compression |
| LGGAN | Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation | CVPR 2020 | 1912.12215 | Ha0Tang/LGGAN |
| TSIT | TSIT: A Simple and Versatile Framework for Image-to-Image Translation | ECCV 2020 | 2007.12072 | EndlessSora/TSIT |
| SegVAE | Controllable Image Synthesis via SegVAE | ECCV 2020 | 2007.08397 | yccyenchicheng/SegVAE |
| SESAME | SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects | ECCV 2020 | 2004.04977 | |
| Style Semantics | Controlling Style and Semantics in Weakly-Supervised Image Generation | ECCV 2020 | 1912.03161 | dariopavllo/style-semantics |
Texture Patch Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| TextureGAN | TextureGAN: Controlling Deep Image Synthesis with Texture Patches | CVPR 2018 | 1706.02823 | janesjanes/Pytorch-TextureGAN |
| Guided-pix2pix | Guided Image-to-Image Translation with Bi-Directional Feature Transformation | ICCV 2019 | 1910.11328 | vt-vl-lab/Guided-pix2pix |
Example Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| EG-UNIT | Exemplar Guided Unsupervised Image-to-Image Translation | ICLR 2019 | 1805.11145 | charliememory/EGSC-IT |
| Pix2pixSC | Example-Guided Style-Consistent Image Synthesis from Semantic Labeling | CVPR 2019 | 1906.01314 | cxjyxxme/pix2pixSC |
Attention Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| DA-GAN | DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks | CVPR 2018 | 1802.06454 | |
| Attention-GAN | Attention-GAN for Object Transfiguration in Wild Images | ECCV 2018 | 1803.06798 | |
| UAIT | Unsupervised Attention-guided Image to Image Translation | NeurIPS 2018 | 1806.02311 | AlamiMejjati/Unsupervised-Attention-guided-Image-to-Image-Translation |
| Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention | TIP 2019 | 1806.06195 | ||
| AttentionGAN | Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation | IJCNN 2019 | 1903.12296 | Ha0Tang/AttentionGAN |
| U-GAT-IT | U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation | ICLR 2020 | 1907.10830 | taki0112/UGATIT, znxlwm/UGATIT-pytorch |
Mask Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| ContrastGAN | Generative Semantic Manipulation with Mask-Contrasting GAN | ECCV 2018 | 1708.00315 | |
| InstaGAN | Instance-aware image-to-image translation | ICLR 2019 | 1812.10889 | sangwoomo/instagan |
| INIT | Towards Instance-level Image-to-Image Translation | CVPR 2019 | 1905.01744 | project |
Text Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| ControlGAN | Controllable Text-to-Image Generation | NeurIPS 2019 | 1909.07083 | mrlibw/ControlGAN |
| DMIT | Multi-mapping Image-to-Image Translation via Learning Disentanglement | NeurIPS 2019 | 1909.07877 | Xiaoming-Yu/DMIT |
| ManiGAN | ManiGAN: Text-Guided Image Manipulation | 1912.06203 | ||
| RefinedGAN | Image-to-Image Translation with Text Guidance | 2002.05235 |
Audio Guided
| Model | Paper | Conference | Arxiv | Code |
|---|---|---|---|---|
| X2Face | X2Face: A Network for Controlling Face Generation using Images, Audio, and Pose Codes | ECCV 2018 | 1807.10550 | oawiles/X2Face |