VQ-Font About the training of the VQ-VAE.

About the training of the VQ-VAE.

Open LsFlyt opened this issue 1 year ago • 2 comments

In VQ-Font/model/VQ-VAE.ipynb.

for i in xrange(num_training_updates): data = next(iter(train_loader)) train_data_variance = torch.var(data) # print(train_data_variance) # show(make_grid(data.cpu().data) ) # break data = data - 0.5 # normalize to [-0.5, 0.5] data = data.to(device) optimizer.zero_grad()

The code normalize data to [-0.5, 0.5]. However, the last layer of the decoder of the VQ-VAE model is sigmoid. Is this a mistake?

Nov 14 '23 08:11 LsFlyt

And another question, in VQ-VAE, the data are normalized to [-0.5, 0.5], but in the training phase 2, the content image (which feeds to the content encoder) is normalized [-1, 1].

Nov 17 '23 03:11 LsFlyt

Sorry for the late reply, i can not remember the reason i use 'data = data - 0.5 # normalize to [-0.5, 0.5]' due to the long time. Maybe is a mistake or not.

Nov 17 '23 07:11 awei669

VQ-Font VQ-Font copied to clipboard

About the training of the VQ-VAE.

VQ-Font
VQ-Font copied to clipboard