daam This is not an issue but a request. Can we run this on any image with generated captions?

This is not an issue but a request. Can we run this on any image with generated captions?

Open runner22k opened this issue 2 years ago • 3 comments

This will help us in creating better descriptions of an image while training Textual inversion (embeddings) and LoRA. How to use this on any image?

does it work other-way around too? Can we feed an image to the DAAM and get text prompt with heat maps?

Feb 10 '23 22:02 runner22k

I have the same question, does the model accept any non generated image and a given caption? I would like to use this model for zero shot object localization.

Mar 27 '23 09:03 BingliangLi

In theory yes, I'll look into implementing this further when I have time.

Jun 12 '23 18:06 daemon

I managed to achieve this using this plugin: link. Use img2img and set step to 1, denoising strength to 0, and you are all set!

Jun 13 '23 00:06 BingliangLi

daam daam copied to clipboard

This is not an issue but a request. Can we run this on any image with generated captions?

daam
daam copied to clipboard