posterior-collapse-list
                                
                                 posterior-collapse-list copied to clipboard
                                
                                    posterior-collapse-list copied to clipboard
                            
                            
                            
                        A curated list of techniques to avoid posterior collapse
Lots of papers have been trying to address the problem of posterior collapse with VAEs. Due to huge number of publications, I thought it is intersting to have a list of related papers.
| Paper | Implementation | Publication Year | Citation | 
|---|---|---|---|
| Ladder variational autoencoders | Yes | 2016 | 200+ | 
| Fixing a broken ELBO | No | 2017 | 70+ | 
| Neural discrete representation learning | Third Party | 2017 | 128+ | 
| Tackling Over-pruning in Variational Autoencoders | No | 2017 | 20+ | 
| Filtering Variational Objectives | Yes | 2017 | 60+ | 
| Auxiliary Guided Autoregressive Variational Autoencoders | Third Party | 2017 | 10- | 
| VAE with a VampPrior | Yes | 2017 | 70+ | 
| Z-Forcing: Training Stochastic Recurrent Networks | Third Party | 2017 | 40+ | 
| Latent Space Optimal Transport for Generative Models | No | 2018 | 10- | 
| Improving explorability in variational inference with annealed variational objectives | No | 2018 | 10- | 
| Taming VAEs | No | 2018 | 10+ | 
| Semi-Amortized Variational Autoencoders | No | 2018 | 40+ | 
| Avoiding Latent Variable Collapse with Generative Skip Models | No | 2018 | 15+ | 
| Spherical Latent Spaces for Stable Variational Autoencoders | Yes | 2018 | 10+ | 
| Unsupervised Discrete Sentence Representation Learning for Interpretable Neural Dialog Generation | No | 2018 | 10+ | 
| Learning Latent Representations For Style Control And Transfer in End-To-End Speech Synthesis | Third Party | 2018 | 10- | 
| The Mutual Autoencoder: Controlling Information in Latent Code Representations | No | 2018 | 10- | 
| Hierarchicaly-Structured Variational Autoencoder For Long Text Generation | No | 2018 | 10- | 
| Iterative Amortized Inference | No | 2018 | 20+ | 
| BIVA: A very deep hierarchy of latent variables for generative modeling | No | 2019 | 10- | 
| preventing posterior collapse with delta-VAEs | No | 2019 | 10- | 
| Diagnosing and enhancing VAE model | No | 2019 | 10- | 
| MAE: Mutual Posterior-Divergence Regularization For Variational Auto Encoder | No | 2019 | 10- | 
| Topic-Guided Variational Autoencoders for Text Generation | No | 2019 | 10- | 
| Importance Weighted Hierarchical Variational Inference | No | 2019 | 10- | 
| Generated Loss, Augmented Training, And Multiscale VAE | No | 2019 | 40+ | 
| mμ-Forcing: Training Variational Recurrent Autoencoders for Text Generation | No | 2019 | 10- | 
| Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces | No | 2019 | 10- | 
| Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling | No | 2019 | 10- | 
| LIA: Latently Invertible Autoencoder with Adversarial Learning | Yes | 2019 | 10- | 
| Compound Variational Auto-Encoder | No | 2019 | 10- | 
| Quantization-Based Regularization for Autoencoders | No | 2019 | 10- | 
| Lagging Ingerence Network And Posterior Collapse In Variational Autoencoders | Yes | 2019 | 10+ | 
| Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing | No | 2019 | 10- | 
| Understanding Posterior Collapse in Generative Latent Variable Models | No | 2019 | 10- | 
Some papers observed posterior collapse for a particular task and tried to alleviate it mostly by KL-annealing:
| Paper | Implementation | Publication Year | Citation | 
|---|---|---|---|
| Generating Sentences from a Continuous Space | Third Party | 2015 | 600+ | 
| A Neural Representation of Sketch Drawings | No | 2017 | 150+ | 
| Improved Variational Autoencoders for Text Modeling using Dilated Convolutions | Third Party | 2017 | 80+ | 
| Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space | Thrid Party | 2017 | 40+ | 
| A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music | Yes | 2018 | 50+ | 
| Improving Variational Encoder-Decoders in Dialogue Generation | Third Party | 2018 | 50 | 
| The challenge of realistic music generation: modelling raw audio at scale | No | 2018 | 20+ | 
| Learning Product Codebooks Using Vector-Quantized Autoencoder For Image Retrieval | Yes | 2018 | 20+ | 
| Auto-Encoding Variational Neural Machine Translation | No | 2018 | 20+ | 
| Trajectory-User Linking via Variational AutoEncoder | No | 2018 | 20+ | 
| Structure-aware Generative Network for 3D-Shape Modeling | No | 2018 | 20+ | 
| Auto-Encoding Variational Neural Machine Translation | No | 2018 | 20+ | 
| Unsupervised speech representation learning using WaveNet autoencoders | No | 2019 | 10- | 
| Syntax-Infused Variational Autoencoder for Text Generation | No | 2019 | 10- | 
| Unsupervised Recurrent Neural Network Grammars | No | 2019 | 10- | 
| Learning Latent Plans from Play | No | 2019 | 10- | 
| DialogWAE: Multimodal Response Generation With Conditional Wasserstein Auto-Encoder | No | 2019 | 10- | 
Highly related papers but not exactly on Posterior Collapse:
| Paper | Implementation | Publication Year | Citation | 
|---|---|---|---|
| Adversarial Autoencoders | Third Party | 2015 | 750+ | 
| Improved variational inference with inverse autoregressive flow | Yes | 2016 | 400+ | 
| Stick-Breaking VAE | No | 2016 | 40+ | 
| Variational lossy autoencoder | No | 2016 | 190+ | 
| Symmetrized Variational Inference | No | 2016 | 10- | 
| ELBO surgery: yet another way to carve up the variational evidence lower bound | No | 2017 | 70+ | 
| Adversarially Regularized Autoencoders | Yes | 2018 | 60+ | 
| Adversarial Symmetric Variational Autoencoder | No | 2017 | 25+ | 
| beta-VAE: Learning basic visual concepts with a constrained variational framework | Third Party | 2017 | 350+ | 
| Distribution matching in variational inference | Third Party | 2017 | 15+ | 
| Learning Latent Representations for Style Ccontrol And Transfer In end-to-end Speech Synthesis | Third Party | 2018 | 50+ | 
| Wassertain Auto-Encoder: Latent Dimentionality And Random Encoders | No | 2018 | 10- | 
| Learning Deep Representation by Mutual Information Estimation And Maximization | No | 2018 | 20+ | 
| Sinkhorn AutoEncoders | No | 2018 | 10- | 
| Learning Priors for Adversarial Autoencoders | No | 2018 | 10- | 
| Hyperspherical Variational Auto-Encoders | Yes | 2018 | 30+ | 
| Universal Audio Synthesizer Control With Normalizing Flows | Yes | 2018 | 10- | 
| Representation Learning with Contrastive Predictive Coding | Third Party | 2018 | 50+ |