CFLD icon indicating copy to clipboard operation
CFLD copied to clipboard

[CVPR 2024 Highlight] Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis

Results 5 CFLD issues
Sort by recently updated
recently updated
newest added

In your paper, Perception-Refined Decoder uses source image encoder. So, I thought appearance encoder should be used, but in your code you use 'down_block_additional_residuals' which uses pose encoder. Why is...

Is pose map(keypoint coordinate) must needed? I want to inference with my own dataset for pose. def build_pose_img(annotation_file, img_path): string = annotation_file.loc[os.path.basename(img_path)] array = load_pose_cords_from_strings(string['keypoints_y'], string['keypoints_x']) pose_map = torch.tensor(cords_to_map(array, (256,...

![image](https://github.com/user-attachments/assets/524acbcf-10f0-4468-889d-f4af00e77175) Hello authors, thanks for the great work. I want to know how you obtained the results in Table 2, which seem to differ from those provided in the original...

100%|███████████████████████████████████████████████████████████████████████████████████████████| 233M/233M [00:16