taming-transformers icon indicating copy to clipboard operation
taming-transformers copied to clipboard

Generating higher resolution images for FacesHQ data

Open janismdhanbad opened this issue 3 years ago • 1 comments

Hello authors,

Thank you for the amazing work! I am trying to generate a face image of higher resolution (512x512). My strategy is to initiate the z_q as a random vector of integers between (0, 1023) of dimension 1x1024, reshape this to 32x32 and then use model.decode_to_img to make an image of 512x512. To make a sensible image of faces I autoregressively generate the next codebook token using row-major sequence on the 32x32 matrix, similar mechanism as given in the notebook here . Unfortunately, the final image I get is something like this

download

It is something like the repetition of a pattern of faces. Could you please guide me on this?

Thanks!

janismdhanbad avatar May 25 '21 21:05 janismdhanbad

I am facing the same issue. Any luck?

dysdsyd avatar Jun 04 '21 19:06 dysdsyd