A-watermark-for-Diffusion-Models icon indicating copy to clipboard operation
A-watermark-for-Diffusion-Models copied to clipboard

Does it handle image editing as robustly as VINE?

Open Wei-Lin-Li opened this issue 10 months ago • 7 comments

Thank you for your work on this interesting project!

I recently came across a watermarking model VINE, which emphasizes robustness to image editing (such as image inversion, text-driven editing, image-to-video generation, etc.).

I’m curious to understand if your method is robust against such kinds of editing.

If you’ve conducted any evaluations or comparisons under such conditions, I would really appreciate any pointers or insights.

Wei-Lin-Li avatar May 30 '25 01:05 Wei-Lin-Li

I just generated some watermarked images by FLUX and edited by ledits and detected the bit accuracy, sadly, it just around 50%. I'm afraid it's not robust against these regenerated-based attacks.

lthero-big avatar May 30 '25 09:05 lthero-big

Oh, I see, that is interesting! Thanks for your experiments!

Wei-Lin-Li avatar May 30 '25 09:05 Wei-Lin-Li

There are additional details that I would like to share. After applying methods such as ledits to images generated by FLUX(flux1-dev), the watermark is undetectable( Bit Accuracy is around 50%). However, images generated by SDv2-1 maintain a high Bit Accuracy of approximately 99% after editing.

lthero-big avatar May 30 '25 11:05 lthero-big

That's an interesting observation! Do you have any insights into why this might be happening?

Wei-Lin-Li avatar May 30 '25 11:05 Wei-Lin-Li

Does this mean that this watermarking method is tailored for SD, and needs some improvement if wanna apply it to FLUX? Is there any way to combine VINE's robustness with this watermark method on FLUX?

Wei-Lin-Li avatar May 30 '25 11:05 Wei-Lin-Li

ledits may be optimized for SD, allowing for better preservation of the latent space structure of SDv2-1 during the editing process, thereby come with high watermark Bit Accuracy. In contrast, editing with images generated by FLUX may significantly alter the latent space, reducing the watermark detection rate.

lthero-big avatar May 30 '25 12:05 lthero-big

This makes sense. Thanks for your patient explanation! I will further explore it.

Wei-Lin-Li avatar May 30 '25 13:05 Wei-Lin-Li