Quynh Phung

Results 1 issues of Quynh Phung

It is not an issue. I am curious why cross-attention maps of stage c are quite noisy, not represent objects mask like Stable Diffusion? any suggestion for this problem? Thanks