Grounded-Diffusion icon indicating copy to clipboard operation
Grounded-Diffusion copied to clipboard

Open-vocabulary Object Segmentation with Diffusion Models

Results 7 Grounded-Diffusion issues
Sort by recently updated
recently updated
newest added

How long does it take to retrain this model? 是否有并行训练这个模型的方法?一张卡上训练的话好像需要很久很久;

I follow the readme use python train.py --class_split 1 --train_data random --save_name pascal_1_random ' to train the model and generate the checkpoints;now how to evaluate them? I dont find the...

Hello, Thank you for your excellent work.I am very interested in the teaser input images, it is the following pictures, ![image](https://github.com/Lipurple/Grounded-Diffusion/assets/87483029/35fad92d-ea76-41f2-908f-1205e9ce8707) Can you post them?

Thanks for your excellent work! I am confused about the definition of open-vocabulary segmentation from two aspects: 1. I note that the segmentation model (i.e., maskformer in the paper) is...

when i run Inference, python test.py --sd_ckpt 'lipurple/stable_diffusion.ckpt' --grounding_ckpt 'lipurple/grounding_module.pth' --prompt "a photo of a lion on a mountain top at sunset" --category "lion" and will occur :RuntimeError: PytorchStreamReader failed...

Thanks for the great work! At the moment, the provided train.py seems to be hardwired to train on the Pascal VOC dataset. Is there a plan to release the COCO...

Thanks a lot for your great work! May I know what is the inference speed for generating grounded images?