VQGAN-pytorch
VQGAN-pytorch copied to clipboard
How to distinguish sos token(default = 0) and quantified image token zero ?
Thanks for your video. Since the transformer take in the quantified image token generated by VQGAN, which codebook has indices (0~n_embed-1), and transformer’s sos token is also set to zero defaultly. Could you tell me why we don't distinguish codebook vector0 and the sos token when training transformer?