pytorch-vq-vae icon indicating copy to clipboard operation
pytorch-vq-vae copied to clipboard

Replicating the bits / dim results in the original VQ-VAE paper

Open pclucas14 opened this issue 5 years ago • 7 comments

Hi,

thanks for your clean implementation ! I was wondering, have you ever tried to calculate the bits / dimension metric (as in the original paper) ? I've tried to do so using the provided code, and I'm still quite far from the results in the paper. I was hoping maybe you would have some insight to share as to why that is the case.

Thanks! Lucas

pclucas14 avatar Jun 11 '19 17:06 pclucas14

@pclucas14 I haven't tried to do that but plan to... give me a few days to figure it out... I am trying to compile a table of bits/dim for MNIST/ Fashion-MNIST so I might as well start with this

kashif avatar Jun 15 '19 12:06 kashif

great! let me know what you end up getting :)

pclucas14 avatar Jun 15 '19 16:06 pclucas14

Thanks! Here is where I am collecting the results: zalandoresearch/fashion-mnist#144

kashif avatar Jun 15 '19 16:06 kashif

great! it's a very complete list, pretty cool :D

pclucas14 avatar Jun 17 '19 15:06 pclucas14

The best I got for VQ-VAE on CIFAR-10 was 5.00 BPD (using a unigram prior). This is still far from the results in the VQ-VAE paper. Hopefully you can manage to close the gap even more :)

pclucas14 avatar Jun 18 '19 17:06 pclucas14

Hi @kashif , could you provide the code you use to compute the bpd?

Thanks!

pvitoria avatar Sep 16 '20 09:09 pvitoria

Wonder if there is code for proper bpd computation? If so, can you please share?

podlipensky avatar Apr 23 '23 21:04 podlipensky