Grounded-Diffusion issues

How long does it take to retrain this model?

How long does it take to retrain this model? 是否有并行训练这个模型的方法？一张卡上训练的话好像需要很久很久；

how to evaluate the checkpoint after train?

I follow the readme use python train.py --class_split 1 --train_data random --save_name pascal_1_random ' to train the model and generate the checkpoints；now how to evaluate them? I dont find the...

yinyjin

teaser image

Hello, Thank you for your excellent work.I am very interested in the teaser input images, it is the following pictures, ![image](https://github.com/Lipurple/Grounded-Diffusion/assets/87483029/35fad92d-ea76-41f2-908f-1205e9ce8707) Can you post them?

develop-productivity

The confused definition of open-vocabulary segmentation

Thanks for your excellent work! I am confused about the definition of open-vocabulary segmentation from two aspects: 1. I note that the segmentation model (i.e., maskformer in the paper) is...

kinredon

RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory

2

when i run Inference, python test.py --sd_ckpt 'lipurple/stable_diffusion.ckpt' --grounding_ckpt 'lipurple/grounding_module.pth' --prompt "a photo of a lion on a mountain top at sunset" --category "lion" and will occur ：RuntimeError: PytorchStreamReader failed...

cvJie

Release of COCO training script

Thanks for the great work! At the moment, the provided train.py seems to be hardwired to train on the Pascal VOC dataset. Is there a plan to release the COCO...

RogerQi

Inference Speed

1

Thanks a lot for your great work! May I know what is the inference speed for generating grounded images?

vidit98

Grounded-Diffusion
Grounded-Diffusion copied to clipboard

Metadata

How long does it take to retrain this model?

how to evaluate the checkpoint after train?

teaser image

The confused definition of open-vocabulary segmentation

RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory

Release of COCO training script

Inference Speed

← Metadata

Owner

Metadata

Grounded-Diffusion Grounded-Diffusion copied to clipboard

Metadata

How long does it take to retrain this model?

how to evaluate the checkpoint after train?

teaser image

The confused definition of open-vocabulary segmentation

RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory

Release of COCO training script

Inference Speed

← Metadata

Owner

Metadata

Grounded-Diffusion
Grounded-Diffusion copied to clipboard