InstructDiffusion icon indicating copy to clipboard operation
InstructDiffusion copied to clipboard

PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.

Results 17 InstructDiffusion issues
Sort by recently updated
recently updated
newest added

Thanks for your great work. I notice that you use COCO, CrowdPose, MPII, and AIC datasets for keypoint estimation. Have you ablated the influence of the diversity of the dataset...

Hi, I see that in ./stable_diffusion/ldm/models/diffusion/ddpm_edit.py, params uncond=0.05 in `def get_input( )` is not used, replaced with fixed 0.075: ![image](https://github.com/cientgu/InstructDiffusion/assets/56153611/b3cd1227-67c5-4f62-ac53-43d9a654dcdb) Also, bug: @torch.no_grad() has been commented out.

Thanks for your great work! I am curious about the the result of classification and detection in your paper: ![Screenshot 2023-09-10 174312](https://github.com/cientgu/InstructDiffusion/assets/93043187/0dbe12fc-9f35-46c8-bdd3-6cffd6d75512) You mention that you design special prompts for...

The repo contains a broken link to "Web demo", leading to page with the text "No interface is running right now".

Thanks for your great work! I notice that your [config file](https://github.com/cientgu/InstructDiffusion/blob/main/configs/instruct_diffusion.yaml) load the checkpoint "v1-5-pruned-emaonly-adaption.ckpt", which seems correspond to the "Pretraining adaptation" phase in your main paper. Meanwhile, it seems...

Hi! Thanks for your impressive work! I am trying to run the task-specific training and need to load the checkpoint generated in the [adaptation phase](https://github.com/cientgu/InstructDiffusion/blob/main/configs/instruct_diffusion.yaml#L11). Could you please provide this...

Hi, very interesting work! As mentioned in the paper, a lightweight U-Net is leveraged to post-processes the output image to generate some heatmaps (for keypoint detection) or masks (for segmentation)....

![image](https://github.com/cientgu/InstructDiffusion/assets/51200306/19fd4c22-4ee7-409c-88b0-885675bd7f7b) Equation 1 here seems to be the instruction condition ci? Could the author confirm this?