sam
sam copied to clipboard
Will Layernorm or Groupnorm cause problems?
As mentioned in Readme, the suggested usage can potentially cause problems if you use batch normalization. Will Layernorm or Groupnorm cause problems in principle? I use SAM in Swintransformer and ATSS, No effect achieved. I want to find the reason.