PIDM icon indicating copy to clipboard operation
PIDM copied to clipboard

Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)

Results 31 PIDM issues
Sort by recently updated
recently updated
newest added

I've upgraded the code to be compatible with PyTorch2.0 and also replaced the attention/crossattention node to use PyTorch's build-in Multihead Attention which also of course supports flash attention out of...

how to generate the reference_pose_0.npy from image ?

Thanks for sharing this great work. In the paper, you mentioned that "transfer rich multi-scale texture patterns from the source image distribution to the noise prediction" How ever, in the...

Hi, Current pipeline seems to start from complete noise, is it possible to have a sample code snippet where the generation starts from latents generated from another image like in...

Have you ever conducted a CFG-deactivating ablation experiment? I'm curious as to whether deactivating CFG will significantly affect the results.

![453A8A48-8D85-4826-9FDC-8945D88313FF](https://user-images.githubusercontent.com/26623882/236680341-92e667d7-e47f-4e9a-b25a-3847ffe59a1c.png) Thanks for your great work. did you do any ablation study of the TDB reference feature size, for example with 64x64, 32x32, 16x16, 8x8?

Incredible work! However, the code of the model structure is quite hard to read for me. Is there any chance to post a model structure figure or anything that helps...

frozen_out = th.cat([model_output.detach(), model_var_values], dim=1) terms["vb"] = self._vb_terms_bpd( model=lambda *args, r=frozen_out: r, x_start=x_start, x_t=x_t, t=t, clip_denoised=False, )["output"]