Deep_Generative_Models
                                
                                 Deep_Generative_Models copied to clipboard
                                
                                    Deep_Generative_Models copied to clipboard
                            
                            
                            
                        A collection of papers I am interested in.
GAN-Inversion
A collection of papers I am interested in.
Awesome
- https://ait.ethz.ch/index.php
- https://liuyebin.com/student.html
- https://virtualhumans.mpi-inf.mpg.de/
- https://ps.is.mpg.de/publications
- https://www.mpi-inf.mpg.de/departments/visual-computing-and-artificial-intelligence/publications
- https://ait.ethz.ch/people/hilliges/
- https://vlg.inf.ethz.ch/publications.html
Renderer
- https://github.com/eth-ait/aitviewer
- https://github.com/mitsuba-renderer/mitsuba3
- https://github.com/angeloskath/simple-3dviz
- https://github.com/BachiLi/redner
Pybind
- https://github.com/pybind/cmake_example
Video
- https://github.com/mli/autocut
Project
- mmgeneration
- inr-gan
- ADA
- awesome-image-translation
- awesome-gan-inversion
- naver-webtoon-faces
- GAN Experiments
- timm
- fun-with-computer-graphics
Face
3D
Tools
- bokeh
- face-parsing.PyTorch
- label-studio
- streamlit-drawable-canvas
- face-alignment
- remove images background
GUI
- https://github.com/gradio-app/gradio
StyleGAN
- https://github.com/justinpinkney/awesome-pretrained-stylegan2
- https://github.com/justinpinkney/awesome-pretrained-stylegan3
- generative-evaluation-prdc
Style transfer
Art
- https://github.com/fogleman/primitive
Anime
- https://github.com/TachibanaYoshino/AnimeGAN
- https://github.com/TachibanaYoshino/AnimeGANv2
TOC
- To be read
- Disentanglement
- Inversion
- Encoder
- Survey
- GANs
- Style transfer
- Metric
- Spectrum
- Weakly Supervised Object Localization
- NeRF
- 3D
arXiv
Disentanglement
| Title | Venue | Code | Year | 
|---|---|---|---|
| GANSpace: Discovering Interpretable GAN Controls | arXiv:2004.02546 [cs] | GANSpace | 2020 | 
| Interpreting the Latent Space of GANs for Semantic Face Editing | CVPR | InterFaceGAN | 2020 | 
| Closed-Form Factorization of Latent Semantics in GANs | arXiv:2007.06600 [cs] | sefa | 2020 | 
| StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation | arXiv:2011.12799 [cs] | StyleSpace | 2020 | 
| Unsupervised Image Transformation Learning via Generative Adversarial Networks | arXiv:2103.07751 [cs] | github | 2021 | 
| Resolution Dependent GAN Interpolation for Controllable Image Synthesis Between Domains | arXiv:2010.05334 [cs] | toonify | 2020 | 
| WarpedGANSpace: Finding Non-Linear RBF Paths in GAN Latent Space | arXiv:2109.13357 [cs] | 2021 | |
| [Discovering Interpretable Latent Space Directions of GANs beyond Binary Attributes] CVPR | 2021 | 
Semantic hierarchy
| Title | Venue | Code | Year | 
|---|---|---|---|
| Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis | arXiv:1911.09267 [cs] | 2020 | 
Inversion
Optimization
Encoder
Hybrid optimization
| Title | Venue | Code | Year | 
|---|---|---|---|
| Generative Visual Manipulation on the Natural Image Manifold | ECCV | 2018 | |
| Semantic Photo Manipulation with a Generative Image Prior | ACM Transactions on Graphics | 2019 | |
| Seeing What a GAN Cannot Generate | arXiv:1910.11626 [cs, eess] | 2019 | |
| In-Domain GAN Inversion for Real Image Editing | ECCV | 2020 | 
Without optimization
| Title | Venue | Code | Year | 
|---|---|---|---|
| Closed-Form Factorization of Latent Semantics in GANs | arXiv:2007.06600 [cs] | 2020 | |
| GAN “Steerability” without Optimization | arXiv:2012.05328 [cs] | 2021 | |
| Low-Rank Subspaces in GANs | arXiv:2106.04488 [cs] | 2021 | |
| LARGE: Latent-Based Regression through GAN Semantics | arXiv:2107.11186 [cs] | 2021 | |
| Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation | ICCV | 2021 | |
| Controllable and Compositional Generation with Latent-Space Energy-Based Models | NeurIPS | LACE | 2021 | 
| Do Generative Models Know Disentanglement? Contrastive Learning Is All You Need | arXiv:2102.10543 [cs] | DisCo | 2021 | 
DGP
| Title | Venue | Code | Year | 
|---|---|---|---|
| :heavy_check_mark: Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation | ECCV | DGP | 2020 | 
| :heavy_check_mark: PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models | CVPR | PULSE | 2020 | 
| :heavy_check_mark: GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution | arXiv:2012.00739 [cs] | 2020 | |
| Unsupervised Portrait Shadow Removal via Generative Priors | arXiv:2108.03466 [cs] | 2021 | |
| Towards Real-World Blind Face Restoration with Generative Facial Prior | CVPR | GFPGAN | 2021 | 
| Towards Vivid and Diverse Image Colorization with Generative Color Prior | ICCV | 2021 | |
| Self-Validation: Early Stopping for Single-Instance Deep Generative Priors | arXiv:2110.12271 [cs.CV] | 2021 | |
| One-Shot Generative Domain Adaptation | arXiv:2111.09876 [cs] | 2021 | |
| :heart: Time-Travel Rephotography | ACM Transactions on Graphics | code | 2021 | 
Cls
| Title | Venue | Code | Year | 
|---|---|---|---|
| Contrastive Model Inversion for Data-Free Knowledge Distillation | arXiv:2105.08584 [cs] | 2021 | |
| Generative Models as a Data Source for Multiview Representation Learning | arXiv:2106.05258 [cs] | 2021 | |
| Inverting and Understanding Object Detectors | arXiv:2106.13933 [cs] | 2021 | |
| Deep Neural Networks Are Surprisingly Reversible: A Baseline for Zero-Shot Inversion | arXiv:2107.06304 [cs] | 2021 | |
| Ensembling with Deep Generative Views | arXiv:2104.14551 [cs] | 2021 | 
Change pose implicitly
| Title | Venue | Code | Year | 
|---|---|---|---|
| On the “Steerability” of Generative Adversarial Networks | arXiv:1907.07171 [cs] | 2020 | |
| Interpreting the Latent Space of GANs for Semantic Face Editing | CVPR | 2020 | |
| GANSpace: Discovering Interpretable GAN Controls | arXiv:2004.02546 [cs] | GANSpace | 2020 | 
| Closed-Form Factorization of Latent Semantics in GANs | arXiv:2007.06600 [cs] | sefa | 2020 | 
| StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN | arXiv:2111.01619 [cs] | 2021 | |
| Using Latent Space Regression to Analyze and Leverage Compositionality in GANs | ICLR | 2021 | 
Survey
| Title | Venue | Code | Year | 
|---|---|---|---|
| GAN Inversion: A Survey | arXiv:2101.05278 [cs] | 2021 | 
GANs
NeurIPS 2021
| Title | Venue | Code | Year | 
|---|---|---|---|
| Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training | NeurIPS | 2021 | 
Theory
| Title | Venue | Code | Year | 
|---|---|---|---|
| :white_check_mark: Towards a Better Global Loss Landscape of GANs | NeurIPS | 2020 | |
| On the Benefit of Width for Neural Networks: Disappearance of Bad Basins | arXiv:1812.11039 [cs, math, stat] | 2021 | 
Regs
| Title | Venue | Code | Year | 
|---|---|---|---|
| The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement | ECCV | 2020 | 
Detection
| Title | Venue | Code | Year | 
|---|---|---|---|
| Self-Supervised Object Detection via Generative Image Synthesis | arXiv:2110.09848 [cs] | 2021 | 
StyleGANs
Transformer
| Title | Venue | Code | Year | 
|---|---|---|---|
| Compositional Transformers for Scene Generation | NeurIPS | 2021 | |
| :heart: GAN-Supervised Dense Visual Alignment | arXiv:2112.05143 [cs] | gangealing | 2021 | 
| Improved Transformer for High-Resolution GANs | arXiv:2106.07631 [cs] | 2021 | |
| MaskGIT: Masked Generative Image Transformer | arXiv:2202.04200 [cs] | 2022 | |
| StyleSwin: Transformer-Based GAN for High-Resolution Image Generation | CVPR | 2022 | 
SinGAN
| Title | Venue | Code | Year | 
|---|---|---|---|
| ExSinGAN: Learning an Explainable Generative Model from a Single Image | arXiv:2105.07350 [cs] | 2021 | 
Video
| Title | Venue | Code | Year | 
|---|---|---|---|
| :heart: Diverse Generation from a Single Video Made Possible | arXiv:2109.08591 [cs] | 2021 | 
GANs
cGANs
| Title | Venue | Code | Year | 
|---|---|---|---|
| Unbiased Auxiliary Classifier GANs with MINE | arXiv:2006.07567 [cs] | 2020 | |
| Twin Auxiliary Classifiers GAN | arXiv:1907.02690 [cs, stat] | 2019 | 
Finetune
| Title | Venue | Code | Year | 
|---|---|---|---|
| FreezeG | github | ||
| :white_check_mark: Freeze the Discriminator: A Simple Baseline for Fine-Tuning GANs | arXiv:2002.10964 [cs, stat] | FreezeD | 2020 | 
| Fine-Tuning StyleGAN2 For Cartoon Face Generation | arXiv:2106.12445 [cs, eess] | Cartoon-StyleGAN | 2021 | 
| Transferring GANs: Generating Images from Limited Data | ECCV | 2018 | |
| Image Generation From Small Datasets via Batch Statistics Adaptation | ICCV | 2019 | |
| MineGAN: Effective Knowledge Transfer From GANs to Target Domains With Few Images | CVPR | 2020 | 
Compression
| Title | Venue | Code | Year | 
|---|---|---|---|
| GAN Compression: Efficient Architectures for Interactive Conditional GANs | CVPR | 2020 | |
| Online Multi-Granularity Distillation for GAN Compression | ICCV | 2021 | |
| Revisiting Discriminator in GAN Compression: A Generator-Discriminator Cooperative Compression Scheme | arXiv:2110.14439 [cs] | GCC | 2021 | 
Detection fake
| Title | Venue | Code | Year | 
|---|---|---|---|
| Robust Attentive Deep Neural Network for Exposing GAN-Generated Faces | arXiv:2109.02167 [cs] | 2021 | 
Segmentation
| Title | Venue | Code | Year | 
|---|---|---|---|
| Labels4Free: Unsupervised Segmentation Using StyleGAN | arXiv:2103.14968 [cs] | 2021 | |
| BigDatasetGAN: Synthesizing ImageNet with Pixel-Wise Annotations | ArXiv:2201.04684 [Cs] | arXiv. 2022 | 
Datasets
alias (ref)
| Title | Venue | Code | Year | 
|---|---|---|---|
| Alias-Free Generative Adversarial Networks | arXiv:2106.12423 [cs, stat] | 2021 | |
| On Buggy Resizing Libraries and Surprising Subtleties in FID Calculation | arXiv:2104.11222 [cs] | 2021 | 
Texture
- https://github.com/carson-katri/dream-textures
Tiles
| Title | Venue | Code | Year | 
|---|---|---|---|
| TileGAN: Synthesis of Large-Scale Non-Homogeneous Textures | ACM Transactions on Graphics | 2019 | |
| InsetGAN for Full-Body Image Generation | arXiv:2203.07293 [cs] | 2022 | |
| Collaging Class-Specific GANs for Semantic Image Synthesis | ICCV | 2021 | 
GAN application
| Title | Venue | Code | Year | 
|---|---|---|---|
| SC-FEGAN: Face Editing Generative Adversarial Network with User’s Sketch and Color | arXiv:1902.06838 [cs] | 2019 | |
| Semantic Text-to-Face GAN -ST^2FG | arXiv:2107.10756 [cs] | 2021 | |
| CRD-CGAN: Category-Consistent and Relativistic Constraints for Diverse Text-to-Image Generation | arXiv:2107.13516 [cs] | 2021 | 
Image-to-Image Translation
Style transfer
- https://github.com/nrupatunga/L0-Smoothing
Metric & perceptual loss
Spectrum
| Title | Venue | Code | Year | 
|---|---|---|---|
| Reproducibility of "FDA: Fourier Domain Adaptation ForSemantic Segmentation | arXiv:2104.14749 [cs] | 2021 | |
| A Closer Look at Fourier Spectrum Discrepancies for CNN-Generated Images Detection | CVPR | 2021 | 
Weakly Supervised Object Localization
| Title | Venue | Code | Year | 
|---|---|---|---|
| TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization | arXiv:2103.14862 [cs] | 2021 | |
| Finding an Unsupervised Image Segmenter in Each of Your Deep Generative Models | arXiv:2105.08127 [cs] | 2021 | |
| Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP | arXiv:2107.12518 [cs] | 2021 | 
Implicit Neural Representations
| Title | Venue | Code | Year | 
|---|---|---|---|
| DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation | arXiv:1901.05103 [cs] | 2019 | |
| Occupancy Networks: Learning 3D Reconstruction in Function Space | arXiv:1812.03828 [cs] | 2019 | |
| :heart: Neural Image Representations for Multi-Image Fusion and Layer Separation | arXiv:2108.01199 [cs] | 2021 | |
| Learning Continuous Image Representation with Local Implicit Image Function | CVPR | 2021 | 
Energy
| Title | Venue | Code | Year | 
|---|---|---|---|
| How to Train Your Energy-Based Models | ArXiv:2101.03288 | arXiv. 2021 | |
| Your Classifier Is Secretly an Energy Based Model and You Should Treat It Like One | ICLR | JEM | arXiv. 2020 | 
| Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models | NeurIPS | Generative-Visual-Prompt | 2022 | 
Flow
| Title | Venue | Code | Year | 
|---|---|---|---|
| Variational Inference with Normalizing Flows | ICML | 2015 | |
| Density Estimation Using Real NVP | ICLR | arXiv. 2017 | 
ChatGPT
- https://github.com/golfzert/chatgpt-chinese-prompt-hack
- https://github.com/rawandahmad698/PyChatGPT
Diffusion
- https://github.com/heejkoo/Awesome-Diffusion-Models
- https://github.com/huggingface/diffusers
- https://github.com/Jack000/glid-3-xl
- https://github.com/SirWaffle/AIrtist-k-diffusion-wrap
- https://github.com/altryne/awesome-ai-art-image-synthesis
- https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
- https://github.com/Jack000/glid-3-xl-stable
- https://github.com/Stability-AI/stablediffusion
Generation
Inversion
| Title | Venue | Code | Year | 
|---|---|---|---|
| ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models | ICCV | ilvr_adm | arXiv. 2021 | 
| Diffusion Models Beat GANs on Image Synthesis | arXiv:2105.05233 [cs, stat] | guided-diffusion | 2021 | 
| An Image Is Worth One Word: Personalizing Text-to-Image Generation Using Textual Inversion | arXiv:2208.01618 | textual_inversion | 2022 | 
| DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | arXiv:2208.12242 | dreambooth, Dreambooth-Stable-Diffusion | 2022 | 
| DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation | CVPR | DiffusionCLIP | 2022 | 
Text-to-image
- https://github.com/GeeveGeorge/Stable-Craiyon
Image_to_image
| Title | Venue | Code | Year | 
|---|---|---|---|
| Palette: Image-to-Image Diffusion Models | arXiv:2111.05826 | Palette-Image-to-Image-Diffusion-Models | 2022 | 
| Image Super-Resolution via Iterative Refinement | arXiv:2104.07636 | Image-Super-Resolution-via-Iterative-Refinement | 2021 | 
3D
- https://github.com/neverix/pixel-dreamfusion
| Title | Venue | Code | Year | 
|---|---|---|---|
| RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation | arXiv:2211.09869 | 2022 | |
| Magic3D: High-Resolution Text-to-3D Content Creation | arXiv:2211.10440 | 2022 | 
Detection
| Title | Venue | Code | Year | 
|---|---|---|---|
| DiffusionInst: Diffusion Model for Instance Segmentation | arXiv:2212.02773 | DiffusionInst | 2022 | 
3D & NeRF
- https://www.meshlab.net/
Sine
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains | arXiv:2006.10739 [cs] | 2020 | ||
| :white_check_mark: Implicit Neural Representations with Periodic Activation Functions | NeurIPS | 2020 | ||
| :white_check_mark: Modulated Periodic Activations for Generalizable Local Functional Representations | arXiv:2104.03960 [cs] | 2021 | ||
| Learned Initializations for Optimizing Coordinate-Based Neural Representations | arXiv:2012.02189 [cs] | nerf-meta | 2021 | |
| Seeing Implicit Neural Representations as Fourier Series | arXiv:2109.00249 [cs] | 2021 | 
INR
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| Adversarial Generation of Continuous Images | arXiv:2011.12026 [cs] | inr-gan | 2020 | |
| Image Generators with Conditionally-Independent Pixel Synthesis | arXiv:2011.13775 [cs] | CIPS | 2020 | |
| A Structured Dictionary Perspective on Implicit Neural Representations | arXiv:2112.01917 [cs] | 2021 | 
3D & NeRF GANs
- https://mrtornado24.github.io/Next3D/
Diffusion
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras | ECCV | DiffuStereo | arXiv. 2022 | |
| DiffRF: Rendering-Guided 3D Radiance Field Diffusion | arXiv:2212.01206 | DiffRF | 2022 | 
NeRF large scene
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| Mega-NeRF: Scalable Construction of Large-Scale NeRFs for Virtual Fly-Throughs | CVPR | 2022 | ||
| Block-NeRF: Scalable Large Scene Neural View Synthesis | CVPR | BlockNeRFPytorch | arXiv. 2022 | |
| IBRNet: Learning Multi-View Image-Based Rendering | arXiv:2102.13090 [cs] | IBRNet | 2021 | 
NeRF
- https://github.com/kakaobrain/NeRF-Factory/ :heart:
- https://github.com/openxrlab/xrnerf
- https://github.com/ActiveVisionLab/nerfmm
- https://github.com/ventusff/improved-nerfmm
- https://github.com/Kai-46/nerfplusplus
- https://github.com/kwea123/nerf_pl
- https://github.com/NVlabs/instant-ngp
- https://github.com/sxyu/nerfvis
- https://github.com/frozoul/4K-NeRF
3D inversion
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| Unsupervised 3D Shape Completion through GAN Inversion | CVPR | 2021 | ||
| 3D GAN Inversion for Controllable Portrait Image Animation | ArXiv:2203.13441 [Cs] | arXiv. 2022 | ||
| Pix2NeRF: Unsupervised Conditional $\pi$-GAN for Single Image to Neural Radiance Fields Translation | ArXiv:2202.13162 [Cs] | arXiv. 2022 | ||
| [Monocular 3D Object Reconstruction with GAN Inversion] | ECCV | 2022 | ||
| [INeRF: Inverting Neural Radiance Fields for Pose Estimation] IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) | inerf | 2021 | ||
| Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion | arXiv:2211.11674 | nerf-from-image | 2022 | 
Dynamic
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes | arXiv:2011.13084 [cs] | Neural-Scene-Flow-Fields | 2021 | |
| D-NeRF: Neural Radiance Fields for Dynamic Scenes | arXiv:2011.13961 [cs] | D-NeRF | 2020 | |
| Dynamic View Synthesis from Dynamic Monocular Video | arXiv:2105.06468 [cs] | DynamicNeRF | 2021 | |
| :heart: HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields | arXiv:2106.13228 [cs] | hypernerf | 2021 | |
| Neural Radiance Flow for 4D View Synthesis and Video Processing | 2020 | |||
| :heart: Animatable Neural Implicit Surfaces for Creating Avatars from Videos | arXiv:2203.08133 [cs] | 2022 | 
Voice
- https://github.com/CorentinJ/Real-Time-Voice-Cloning
Hand
- https://github.com/reyuwei/NIMBLE_model
Hair
- https://github.com/clach/Realtime-Vulkan-Hair
Loose garment
- https://cape.is.tue.mpg.de/dataset.html
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| :heavy_check_mark: Predicting Loose-Fitting Garment Deformations Using Bone-Driven Motion Networks | SIGGRAPH | VirtualBones | 2022 | |
| TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style | CVPR | TailorNet_dataset | arXiv. 2020 | |
| Learning Implicit Templates for Point-Based Clothed Human Modeling | ECCV | 2022 | ||
| 3D Clothed Human Reconstruction in the Wild | ECCV | ClothWild_RELEASE | 2022 | |
| :heart: TightCap: 3D Human Shape Capture with Clothing Tightness Field | ACM Transactions on Graphics | TightCap | 2021 | |
| ARCH: Animatable Reconstruction of Clothed Humans | CVPR | ARCH | 2020 | 
Rigging
| Title | Venue | Code | Year | Cite | 
|---|---|---|---|---|
| :heart: [Learning Skeletal Articulations with Neural Blend Shapes] | ACM Transactions on Graphics | neural-blend-shapes | 2021 | 
Anime Body
| Title | Venue | Code | Year | 
|---|---|---|---|
| Collaborative Neural Rendering Using Anime Character Sheets | ArXiv:2207.05378 [Cs] | CoNR | arXiv. 2022 | 
Body
- https://github.com/3DFaceBody/awesome-3dbody-papers
- https://github.com/openMVG/awesome_3DReconstruction_list
- https://github.com/ytrock/THuman2.0-Dataset
- https://github.com/Danial-Kord/DigiHuman
- https://github.com/zhaofuq/Instant-NSR
Body Generation
- https://github.com/justimyhxu/awesome-3D-generation
Body from video
| Title | Venue | Code | Year | 
|---|---|---|---|
| SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video | arXiv:2201.12792 [cs] | 2022 | 
3DMM Face
- https://github.com/tencent-ailab/hifi3dface
- https://github.com/ascust/3DMM-Fitting-Pytorch
| Title | Venue | Code | Year | 
|---|---|---|---|
| Neural Head Reenactment with Latent Pose Descriptors | CVPR | latent-pose-reenactment | 2020 | 
| Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry | arXiv:2110.09772 [cs] | 2021 | |
| REALY: Rethinking the Evaluation of 3D Face Reconstruction | ECCV | REALY | 2022 | 
3D FACE Avatars
- https://github.com/TimoBolkart/BFM_to_FLAME
- https://github.com/HavenFeng/photometric_optimization
- https://github.com/soubhiksanyal/FLAME_PyTorch
- https://github.com/Azmarie/Face-Morphing
| Title | Venue | Code | Year | 
|---|---|---|---|
| A Morphable Model for the Synthesis of 3D Faces | Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques | SIGGRAPH ’99, USA: ACM Press/Addison-Wesley Publishing Co. 1999 | |
| Learning a Model of Facial Shape and Expression from 4D Scans | ACM Transactions on Graphics | [FLAME] | 2017 | 
| :heart: FLAME-in-NeRF : Neural Control of Radiance Fields for Free View Face Animation | arXiv:2108.04913 [cs] | 2021 | |
| Learning a Model of Facial Shape and Expression from 4D Scans | ACM Transactions on Graphics | 2017 | |
| :heart: EMOCA: Emotion Driven Monocular Face Capture and Animation | CVPR | emoca | 2022 | 
| FaceVerse: A Fine-Grained and Detail-Controllable 3D Face Morphable Model from a Hybrid Dataset | CVPR | 2022 | |
| I M Avatar: Implicit Morphable Head Avatars from Videos | CVPR | IMavatar | 2022 | 
| :heavy_check_mark: Neural Head Avatars from Monocular RGB Videos | arXiv:2112.01554 [cs] | neural-head-avatars | 2022 | 
| PVA: Pixel-Aligned Volumetric Avatars | arXiv:2101.02697 [cs] | 2021 | |
| AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis | arXiv:2103.11078 [cs] | 2021 | |
| Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation | arXiv:2201.07786 [cs, eess] | 2022 | |
| HeadGAN: One-Shot Neural Head Synthesis and Editing | arXiv:2012.08261 [cs] | 2021 | |
| KeypointNeRF: Generalizing Image-Based Volumetric Avatars Using Relative Spatial Encoding of Keypoints | arXiv:2205.04992 [cs] | 2022 | |
| Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set | ArXiv:1903.08527 [Cs] | Deep3DFaceRecon_pytorch | arXiv. 2020 | 
Stylization
| Title | Venue | Code | Year | 
|---|---|---|---|
| Unified Implicit Neural Stylization | ECCV | arXiv. 2022 | |
| ARF: Artistic Radiance Fields | ECCV | ARF-svox2 | arXiv. 2022 | 
| UPST-NeRF: Universal Photorealistic Style Transfer of Neural Radiance Fields for 3D Scene | arXiv:2208.07059 | UPST-NeRF | 2022 | 
Face Style
| Title | Venue | Code | Year | 
|---|---|---|---|
| Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer | arXiv:2203.13248 [cs] | DualStyleGAN | 2022 | 
| Stitch It in Time: GAN-Based Facial Editing of Real Videos | arXiv. | STIT | 2022 | 
| Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN | ArXiv:2204.14079 [Cs] | FixNoise | arXiv. 2022 | 
| AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment | ECCV | AnimeCeleb | arXiv. 2022 | 
| DCT-Net: Domain-Calibrated Translation for Portrait Stylization | ACM Transactions on Graphics | DCT-Net | 2022 | 
| VToonify: Controllable High-Resolution Portrait Video Style Transfer | ACM Transactions on Graphics (TOG) | VToonify | n.d. | 
| BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation | NeurIPS | BlendGAN | 2021 | 
| Unpaired Cartoon Image Synthesis via Gated Cycle Mapping | CVPR | 2022 | 
Face Animation
| Title | Venue | Code | Year | 
|---|---|---|---|
| Thin-Plate Spline Motion Model for Image Animation | CVPR | 2022 | |
| Depth-Aware Generative Adversarial Network for Talking Head Video Generation | CVPR | DaGAN | arXiv. 2022 | 
Renderer & Regularization
- https://github.com/ventusff/neurecon
Material and lighting
| Title | Venue | Code | Year | 
|---|---|---|---|
| NeILF: Neural Incident Light Field for Physically-Based Material Estimation | ECCV | neilf | arXiv. 2022 | 
| [NeRF for Outdoor Scene Relighting] | ECCV | NeRF-OSR | 2022 | 
Motion
- https://github.com/xianfei/SysMocap
- https://github.com/zju3dv/EasyMocap
- https://github.com/EricGuo5513/HumanML3D
| Title | Venue | Code | Year | 
|---|---|---|---|
| GANimator: Neural Motion Synthesis from a Single Sequence | ACM Transactions on Graphics (TOG) | ganimator | 2022 | 
| [Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects] | CVPR | watch-it-move | 2022 | 
| Learn to Dance with AIST++: Music Conditioned 3D Dance Generation | ICCV | 2021 | |
| Talking Head(?) Anime from a Single Image 3: Now the Body Too | talking-head-anime | 2022 | |
| PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time | ACM Transactions on Graphics | 2020 | |
| The Wanderings of Odysseus in 3D Scenes | CVPR | GAMMA | arXiv. 2022 | 
| Adversarial Parametric Pose Prior | CVPR | adv_param_pose_prior | arXiv. 2022 | 
| AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars | SIGGRAPH | AvatarCLIP | 2022 | 
| [SOMA: Solving Optical Marker-Based MoCap Automatically] | ICCV | soma | 2021 | 
| MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model | arXiv:2208.15001 | MotionDiffuse | 2022 | 
| TEACH: Temporal Action Composition for 3D Humans | International Conference on 3D Vision (3DV) | teach | arXiv. 2022 | 
| TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts | ECCV | TM2T | 2022 | 
Shape generation
| Title | Venue | Code | Year | 
|---|---|---|---|
| Learning Implicit Fields for Generative Shape Modeling | arXiv:1812.02822 [cs] | 2019 | 
SMPL estimation
- https://github.com/open-mmlab/mmhuman3d
| Title | Venue | Code | Year | 
|---|---|---|---|
| End-to-End Recovery of Human Shape and Pose | CVPR | [hmr] | arXiv. 2018 | 
| VIBE: Video Inference for Human Body Pose and Shape Estimation | CVPR | VIBE | arXiv. 2020 | 
| TransPose: Real-Time 3D Human Translation and Pose Estimation with Six Inertial Sensors | ACM Transactions on Graphics | TransPose | 2021 | 
| Monocular Expressive Body Regression through Body-Driven Attention | European Conference on Computer Vision (ECCV) | expose | 2020 | 
| Human Mesh Recovery from Multiple Shots | CVPR | multishot | arXiv. 2022 | 
| :heart: Learned Vertex Descent: A New Direction for 3D Human Model Fitting | ECCV | LVD | arXiv. 2022 | 
| DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation | ECCV | DeciWatch | arXiv. 2022 | 
| PARE: Part Attention Regressor for 3D Human Body Estimation | ICCV | PARE | arXiv. 2021 | 
| Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers | ECCV | FastMETRO | 2022 | 
Segmentation
- https://github.com/facebookresearch/MaskFormer
| Title | Venue | Code | Year | 
|---|---|---|---|
| Real-Time High-Resolution Background Matting | arXiv:2012.07810 | BackgroundMattingV2 | 2020 | 
| Robust High-Resolution Video Matting with Temporal Guidance | ArXiv:2108.11515 [Cs] | RobustVideoMatting | arXiv. 2021 | 
Datasets
- https://github.com/karfly/human36m-camera-parameters
- https://github.com/deepimagination/TalkingHead-1KH
| Title | Venue | Code | Year | 
|---|---|---|---|
| Structured Local Radiance Fields for Human Avatar Modeling | CVPR | THUman4.0-Dataset | 2022 | 
| Multiface: A Dataset for Neural Face Rendering | ArXiv:2207.11243 [Cs.CV] | multiface | 2022 | 
| ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations | CVPR | ImFace | 2022 | 
FLAME estimation
| Title | Venue | Code | Year | 
|---|---|---|---|
| Towards Metrical Reconstruction of Human Faces | ECCV | MICA | arXiv. 2022 | 
Dog estimation
| Title | Venue | Code | Year | 
|---|---|---|---|
| [BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information] | CVPR | barc_release | 2022 | 
Panoptic
| Title | Venue | Code | Year | 
|---|---|---|---|
| Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation | ArXiv:2203.15224 [Cs] | PanopticNeRF | arXiv. 2022 | 
SDF
- https://github.com/facebookresearch/pifuhd
- https://github.com/pmneila/PyMCubes
| Title | Venue | Code | Year | 
|---|---|---|---|
| :white_check_mark: DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation | arXiv:1901.05103 [cs] | DeepSDF | 2019 | 
| Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling | NeurIPS | 2016 | |
| Occupancy Networks: Learning 3D Reconstruction in Function Space | arXiv:1812.03828 [cs] | 2019 | |
| PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization | arXiv:1905.05172 [cs] | 2019 | |
| Deep Meta Functionals for Shape Representation | arXiv:1908.06277 [cs] | 2019 | 
3D
| Title | Venue | Code | Year | 
|---|---|---|---|
| Escaping Plato’s Cave: 3D Shape From Adversarial Rendering | ICCV | 2019 | |
| StyleRig: Rigging StyleGAN for 3D Control over Portrait Images | arXiv:2004.00121 [cs] | 2020 | |
| Exemplar-Based 3D Portrait Stylization | arXiv:2104.14559 [cs] | github | 2021 | 
| :heart: Landmark Detection and 3D Face Reconstruction for Caricature Using a Nonlinear Parametric Model | arXiv:2004.09190 [cs] | CaricatureFace | 2021 | 
| SofGAN: A Portrait Image Generator with Dynamic Styling | arXiv:2007.03780 [cs] | sofgan | 2021 | 
| :heart: FreeStyleGAN: Free-View Editable Portrait Rendering with the Camera Manifold | arXiv:2109.09378 [cs] | 2021 | |
| PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering | ICCV | PIRender | 2021 | 
Point Cloud
| Title | Venue | Code | Year | 
|---|---|---|---|
| Point-Based Modeling of Human Clothing | ICCV | 2021 | |
| ADOP: Approximate Differentiable One-Pixel Point Rendering | arXiv:2110.06635 [cs] | 2021 | 
Stylization
| Title | Venue | Code | Year | 
|---|---|---|---|
| Learning to Stylize Novel Views | arXiv:2105.13509 [cs] | stylescene | 2021 | 
Datasets
- https://github.com/ofirkris/Faces-datasets
| Title | Venue | Code | Year | 
|---|---|---|---|
| Common Objects in 3D: Large-Scale Learning and Evaluation of Real-Life 3D Category Reconstruction | ICCV | 2021 | |
| A 3D Face Model for Pose and Illumination Invariant Face Recognition | IEEE International Conference on Advanced Video and Signal Based Surveillance | BFM | 2009 | 
| SfSNet: Learning Shape, Reflectance and Illuminance of Faces in the Wild | arXiv:1712.01261 [cs] | 2018 | 
3D-aware image synthesis (ref)
| Title | Venue | Code | Year | 
|---|---|---|---|
| Visual Object Networks: Image Generation with Disentangled 3D Representation | arXiv:1812.02725 [cs, stat] | 2018 | |
| Escaping Plato’s Cave: 3D Shape From Adversarial Rendering | ICCV | 2019 | |
| HoloGAN: Unsupervised Learning of 3D Representations from Natural Images | ICCV | 2019 | 
Face
Tools
- https://github.com/wuhuikai/FaceSwap
- https://github.com/hysts/anime-face-detector
- https://github.com/qq775193759/3D-CariGAN
- https://github.com/yeemachine/kalidokit
- https://github.com/sicxu/Deep3DFaceRecon_pytorch
- https://github.com/happy-jihye/face-vid2vid-demo
Edit
| Title | Venue | Code | Year | 
|---|---|---|---|
| FaceEraser: Removing Facial Parts for Augmented Reality | arXiv:2109.10760 [cs] | 2021 | |
| DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing | arXiv:2109.10737 [cs] | 2021 | |
| :heart: StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators | arXiv:2108.00946 [cs] | 2021 | |
| Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level | arXiv:1902.02593 [cs] | 2019 | |
| Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks | arXiv:2110.08398 [cs] | 2021 | |
| Fine-Grained Control of Artistic Styles in Image Generation | arXiv:2110.10278 [cs] | 2021 | 
Anime Face
- https://github.com/Sxela/ArcaneGAN
- https://github.com/mchong6/GANsNRoses
- https://github.com/FilipAndersson245/cartoon-gan
- https://github.com/venture-anime/cartoongan-pytorch
| Title | Venue | Code | Year | 
|---|---|---|---|
| AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation | arXiv:2102.12593 [cs] | 2021 | |
| [AnimeGAN: A Novel Lightweight GAN for Photo Animation] | AnimeGANv2 | 2020 | |
| :heart: Learning to Cartoonize Using White-Box Cartoon Representations | CVPR | White-box-Cartoonization | 2020 | 
| Generative Adversarial Networks for Photo to Hayao Miyazaki Style Cartoons | arXiv:2005.07702 [cs, eess] | 2020 | 
3DMM
- https://github.com/lattas/AvatarMe
| Title | Venue | Code | Year | 
|---|---|---|---|
| A Morphable Model for the Synthesis of 3D Faces | Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques | [3DMM] | SIGGRAPH ’99, USA: ACM Press/Addison-Wesley Publishing Co. 1999 | 
Face
| Title | Venue | Code | Year | 
|---|---|---|---|
| SketchHairSalon: Deep Sketch-Based Hair Image Synthesis | arXiv:2109.07874 [cs] | 2021 | 
Face Alignment
| Title | Venue | Code | Year | 
|---|---|---|---|
| Face Alignment Across Large Poses: A 3D Solution | IEEE Transactions on Pattern Analysis and Machine Intelligence | 2019 | 
Face Recognition
| Title | Venue | Code | Year | 
|---|---|---|---|
| High-Fidelity Pose and Expression Normalization for Face Recognition in the Wild | 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) | 2015 | 
Face swapping
- https://github.com/mindslab-ai/hififace
3D
DA
| Title | Venue | Code | Year | 
|---|---|---|---|
| Semi-Supervised Domain Adaptation via Adaptive and Progressive Feature Alignment | arXiv:2106.02845 [cs] | 2021 | |
| Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation | arXiv:2101.10979 [cs] | 2021 | 
Data
- https://github.com/koaning/doubtlab
CNN & BN
Light architecture
- https://github.com/yoshitomo-matsubara/torchdistill
- https://github.com/milesial/Pytorch-UNet
| Title | Venue | Code | Year | 
|---|---|---|---|
| Network Augmentation for Tiny Deep Learning | arXiv:2110.08890 [cs] | 2021 | |
| Non-Deep Networks | arXiv:2110.07641 [cs] | 2021 | |
| When to Prune? A Policy towards Early Structural Pruning | arXiv:2110.12007 [cs] | 2021 | |
| :heart: ConformalLayers: A Non-Linear Sequential Neural Network with Associative Layers | arXiv:2110.12108 [cs] | 2021 | |
| CHIP: CHannel Independence-Based Pruning for Compact Neural Networks | arXiv:2110.13981 [cs] | 2021 | |
| Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training | arXiv:2102.02887 [cs] | 2021 | 
Antialiased CNNs
| Title | Venue | Code | Year | 
|---|---|---|---|
| Making Convolutional Networks Shift-Invariant Again | arXiv:1904.11486 [cs] | 2019 | |
| Group Equivariant Convolutional Networks | ICML | arXiv. 2016 | |
| Harmonic Networks: Deep Translation and Rotation Equivariance | CVPR | arXiv. 2017 | |
| Learning Steerable Filters for Rotation Equivariant CNNs | CVPR | arXiv. 2018 | 
Architecture
Compression
| Title | Venue | Code | Year | 
|---|---|---|---|
| AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance | arXiv:2109.06397 [cs] | 2021 | 
Detection
| Title | Venue | Code | Year | 
|---|---|---|---|
| Anchor DETR: Query Design for Transformer-Based Detector | arXiv:2109.07107 [cs] | 2021 | |
| :heart: Detecting Twenty-Thousand Classes Using Image-Level Supervision | arXiv:2201.02605 [cs] | 2022 | 
Segmentation
- https://github.com/xuebinqin/U-2-Net#usage-for-portrait-generation
| Title | Venue | Code | Year | 
|---|---|---|---|
| Robust High-Resolution Video Matting with Temporal Guidance | arXiv:2108.11515 [cs.CV] | 2021 | 
MLP
| Title | Venue | Code | Year | 
|---|---|---|---|
| ResMLP: Feedforward Networks for Image Classification with Data-Efficient Training | arXiv:2105.03404 [cs] | 2021 | |
| ConvMLP: Hierarchical Convolutional MLPs for Vision | arXiv:2109.04454 [cs] | 2021 | |
| A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP | arXiv:2108.13002 [cs.CV] | 2021 | |
| Sparse-MLP: A Fully-MLP Architecture with Conditional Computation | arXiv:2109.02008 [cs] | 2021 | |
| MLP-Mixer: An All-MLP Architecture for Vision | 2021 | ||
| CycleMLP: A MLP-like Architecture for Dense Prediction | ICLR | 2022 | 
Transformer
- https://github.com/xxxnell/how-do-vits-work
- https://github.com/hamidkazemi22/vit-visualization
ssl
- https://github.com/ucasligang/awesome-MIM
Finetune
| Title | Venue | Code | Year | 
|---|---|---|---|
| :heart: How Transferable Are Features in Deep Neural Networks? | arXiv:1411.1792 [cs] | 2014 | 
Positional Encoding
| Title | Venue | Code | Year | 
|---|---|---|---|
| Positional Encoding as Spatial Inductive Bias in GANs | arXiv:2012.05217 [cs] | 2020 | |
| Mind the Pad -- CNNs Can Develop Blind Spots | arXiv:2010.02178 [cs, stat] | 2020 | |
| :heart: How Much Position Information Do Convolutional Neural Networks Encode? | ICLR | 2020 | |
| On Translation Invariance in CNNs: Convolutional Layers Can Exploit Absolute Spatial Location | CVPR | 2020 | |
| Rethinking and Improving Relative Position Encoding for Vision Transformer | ICCV | 2021 | |
| A Structured Dictionary Perspective on Implicit Neural Representations | arXiv:2112.01917 [cs] | 2021 | 
NAS
NAS cls
| Title | Venue | Code | Year | 
|---|---|---|---|
| Neural Architecture Search with Reinforcement Learning | ICLR | 2017 | |
| Learning Transferable Architectures for Scalable Image Recognition | CVPR | 2018 | |
| Progressive Neural Architecture Search | ECCV | 2018 | |
| Efficient Neural Architecture Search via Parameter Sharing | ICML | 2018 | |
| MnasNet: Platform-Aware Neural Architecture Search for Mobile | CVPR | 2019 | |
| DARTS: Differentiable Architecture Search | ICLR | 2019 | 
NAS GAN
| Title | Venue | Code | Year | 
|---|---|---|---|
| AlphaGAN: Fully Differentiable Architecture Search for Generative Adversarial Networks | IEEE Transactions on Pattern Analysis and Machine Intelligence | 2021 | |
| GAN Compression: Efficient Architectures for Interactive Conditional GANs | CVPR | 2020 | |
| Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search | ECCV | 2020 | |
| AutoGAN-Distiller: Searching to Compress Generative Adversarial Networks | ICML | 2020 | |
| A Multi-Objective Architecture Search for Generative Adversarial Networks | 2020 | ||
| AutoGAN: Neural Architecture Search for Generative Adversarial Networks | ICCV | 2019 | 
Low-level
Super-resolution
- https://github.com/nihui/realsr-ncnn-vulkan
Frame Interpolation
| Title | Venue | Code | Year | 
|---|---|---|---|
| FILM: Frame Interpolation for Large Motion | arXiv:2202.04901 [cs] | 2022 | 
Denoising
| Title | Venue | Code | Year | 
|---|---|---|---|
| Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering | IEEE Transactions on Image Processing | 2007 | |
| Towards Flexible Blind JPEG Artifacts Removal | arXiv:2109.14573 [cs, eess] | FBCNN | 2021 | 
Scholar
- https://github.com/tangjiapeng
- Fisher Yu