Gong Chao
Gong Chao
Hi, such an excellect work! I am reading your code and find you don't use shuffle when training [here](https://github.com/TencentARC/BrushNet/blob/101dc3d78ffb9e6acc5d89a656b32d635babc65f/examples/brushnet/train_brushnet.py#L1125). I remember that not using shuffle will damage the convergence effect,...
Hi thanks for your amazing work! I am confused with the subtraction operation `image_features minus text_features`. The image features is encoded by CLIPVisionModelWithProjection but the text features is encoded by...
Hi, thanks for your excellent work! In eq(7), ๐_tโฒ = Concat(๐ โ ๐_๐โฒ , ๐_๐ก ). In eq(8), ๐ฅ = CA(๐ฅ, ๐_๐กโฒ ). Thus x should be CA(๐ฅ, Concat(๐ โ...