posterior-collapse-list
posterior-collapse-list copied to clipboard
A curated list of techniques to avoid posterior collapse
Lots of papers have been trying to address the problem of posterior collapse with VAEs. Due to huge number of publications, I thought it is intersting to have a list of related papers.
| Paper | Implementation | Publication Year | Citation |
|---|---|---|---|
| Ladder variational autoencoders | Yes | 2016 | 200+ |
| Fixing a broken ELBO | No | 2017 | 70+ |
| Neural discrete representation learning | Third Party | 2017 | 128+ |
| Tackling Over-pruning in Variational Autoencoders | No | 2017 | 20+ |
| Filtering Variational Objectives | Yes | 2017 | 60+ |
| Auxiliary Guided Autoregressive Variational Autoencoders | Third Party | 2017 | 10- |
| VAE with a VampPrior | Yes | 2017 | 70+ |
| Z-Forcing: Training Stochastic Recurrent Networks | Third Party | 2017 | 40+ |
| Latent Space Optimal Transport for Generative Models | No | 2018 | 10- |
| Improving explorability in variational inference with annealed variational objectives | No | 2018 | 10- |
| Taming VAEs | No | 2018 | 10+ |
| Semi-Amortized Variational Autoencoders | No | 2018 | 40+ |
| Avoiding Latent Variable Collapse with Generative Skip Models | No | 2018 | 15+ |
| Spherical Latent Spaces for Stable Variational Autoencoders | Yes | 2018 | 10+ |
| Unsupervised Discrete Sentence Representation Learning for Interpretable Neural Dialog Generation | No | 2018 | 10+ |
| Learning Latent Representations For Style Control And Transfer in End-To-End Speech Synthesis | Third Party | 2018 | 10- |
| The Mutual Autoencoder: Controlling Information in Latent Code Representations | No | 2018 | 10- |
| Hierarchicaly-Structured Variational Autoencoder For Long Text Generation | No | 2018 | 10- |
| Iterative Amortized Inference | No | 2018 | 20+ |
| BIVA: A very deep hierarchy of latent variables for generative modeling | No | 2019 | 10- |
| preventing posterior collapse with delta-VAEs | No | 2019 | 10- |
| Diagnosing and enhancing VAE model | No | 2019 | 10- |
| MAE: Mutual Posterior-Divergence Regularization For Variational Auto Encoder | No | 2019 | 10- |
| Topic-Guided Variational Autoencoders for Text Generation | No | 2019 | 10- |
| Importance Weighted Hierarchical Variational Inference | No | 2019 | 10- |
| Generated Loss, Augmented Training, And Multiscale VAE | No | 2019 | 40+ |
| mμ-Forcing: Training Variational Recurrent Autoencoders for Text Generation | No | 2019 | 10- |
| Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces | No | 2019 | 10- |
| Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling | No | 2019 | 10- |
| LIA: Latently Invertible Autoencoder with Adversarial Learning | Yes | 2019 | 10- |
| Compound Variational Auto-Encoder | No | 2019 | 10- |
| Quantization-Based Regularization for Autoencoders | No | 2019 | 10- |
| Lagging Ingerence Network And Posterior Collapse In Variational Autoencoders | Yes | 2019 | 10+ |
| Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing | No | 2019 | 10- |
| Understanding Posterior Collapse in Generative Latent Variable Models | No | 2019 | 10- |
Some papers observed posterior collapse for a particular task and tried to alleviate it mostly by KL-annealing:
| Paper | Implementation | Publication Year | Citation |
|---|---|---|---|
| Generating Sentences from a Continuous Space | Third Party | 2015 | 600+ |
| A Neural Representation of Sketch Drawings | No | 2017 | 150+ |
| Improved Variational Autoencoders for Text Modeling using Dilated Convolutions | Third Party | 2017 | 80+ |
| Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space | Thrid Party | 2017 | 40+ |
| A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music | Yes | 2018 | 50+ |
| Improving Variational Encoder-Decoders in Dialogue Generation | Third Party | 2018 | 50 |
| The challenge of realistic music generation: modelling raw audio at scale | No | 2018 | 20+ |
| Learning Product Codebooks Using Vector-Quantized Autoencoder For Image Retrieval | Yes | 2018 | 20+ |
| Auto-Encoding Variational Neural Machine Translation | No | 2018 | 20+ |
| Trajectory-User Linking via Variational AutoEncoder | No | 2018 | 20+ |
| Structure-aware Generative Network for 3D-Shape Modeling | No | 2018 | 20+ |
| Auto-Encoding Variational Neural Machine Translation | No | 2018 | 20+ |
| Unsupervised speech representation learning using WaveNet autoencoders | No | 2019 | 10- |
| Syntax-Infused Variational Autoencoder for Text Generation | No | 2019 | 10- |
| Unsupervised Recurrent Neural Network Grammars | No | 2019 | 10- |
| Learning Latent Plans from Play | No | 2019 | 10- |
| DialogWAE: Multimodal Response Generation With Conditional Wasserstein Auto-Encoder | No | 2019 | 10- |
Highly related papers but not exactly on Posterior Collapse:
| Paper | Implementation | Publication Year | Citation |
|---|---|---|---|
| Adversarial Autoencoders | Third Party | 2015 | 750+ |
| Improved variational inference with inverse autoregressive flow | Yes | 2016 | 400+ |
| Stick-Breaking VAE | No | 2016 | 40+ |
| Variational lossy autoencoder | No | 2016 | 190+ |
| Symmetrized Variational Inference | No | 2016 | 10- |
| ELBO surgery: yet another way to carve up the variational evidence lower bound | No | 2017 | 70+ |
| Adversarially Regularized Autoencoders | Yes | 2018 | 60+ |
| Adversarial Symmetric Variational Autoencoder | No | 2017 | 25+ |
| beta-VAE: Learning basic visual concepts with a constrained variational framework | Third Party | 2017 | 350+ |
| Distribution matching in variational inference | Third Party | 2017 | 15+ |
| Learning Latent Representations for Style Ccontrol And Transfer In end-to-end Speech Synthesis | Third Party | 2018 | 50+ |
| Wassertain Auto-Encoder: Latent Dimentionality And Random Encoders | No | 2018 | 10- |
| Learning Deep Representation by Mutual Information Estimation And Maximization | No | 2018 | 20+ |
| Sinkhorn AutoEncoders | No | 2018 | 10- |
| Learning Priors for Adversarial Autoencoders | No | 2018 | 10- |
| Hyperspherical Variational Auto-Encoders | Yes | 2018 | 30+ |
| Universal Audio Synthesizer Control With Normalizing Flows | Yes | 2018 | 10- |
| Representation Learning with Contrastive Predictive Coding | Third Party | 2018 | 50+ |