TextFlow icon indicating copy to clipboard operation
TextFlow copied to clipboard

VQ-VAE with discrete flows

Open MichelPezzat opened this issue 3 years ago • 2 comments

I've been trying this autorregresive model for quantizised vectors tokens. So far, the training has been troublesome. Any suggestions? Thanks in advance.

MichelPezzat avatar Apr 06 '21 21:04 MichelPezzat

Weird model! If you’re doing the usual VQ-VAE thing with the straight through estimator, could there be some kind of accumulation of error? I can’t picture the model exactly, but you’ll probably have problems with any VQ-VAE if you have an auto regressive process in training that can “compound” the error from the ST estimator.

zackziegler95 avatar Apr 30 '21 02:04 zackziegler95

I'm not familiar with STE concept. The discrete flow is to model the prior of the discrete bottleneck sequence given by the previously trained encoder. So far, the KL term goes to high (around a 2 billion value) so the training goes nowhere. I guess I'll try something else. Thanks for replying anyway.

MichelPezzat avatar Apr 30 '21 04:04 MichelPezzat