taming-transformers
taming-transformers copied to clipboard
Generating higher resolution images for FacesHQ data
Hello authors,
Thank you for the amazing work! I am trying to generate a face image of higher resolution (512x512). My strategy is to initiate the z_q
as a random vector of integers between (0, 1023) of dimension 1x1024, reshape this to 32x32 and then use model.decode_to_img
to make an image of 512x512. To make a sensible image of faces I autoregressively generate the next codebook token using row-major sequence on the 32x32 matrix, similar mechanism as given in the notebook here . Unfortunately, the final image I get is something like this
It is something like the repetition of a pattern of faces. Could you please guide me on this?
Thanks!
I am facing the same issue. Any luck?