Differences between models

Open syy-create opened this issue 11 months ago • 1 comments

Hello, I noticed that there are two options for arch in Step3_WSI_classification_ACMIL.py, ga and mha. But the reference uses ga, so I want to ask what the difference is between the two? Isn't the mha framework mentioned in the paper? Thank you.

May 14 '25 14:05 syy-create

GA use gate which comes from ABMIL , MHA comes from this paper 3.2 ,this is my understanding

May 16 '25 01:05 Xuxinan001

GA use gate which comes from ABMIL , MHA comes from this paper 3.2 ,this is my understanding But the reference uses ga, so I feel a little confused.I wonder if it's just a different way of getting attention.

May 16 '25 02:05 syy-create

Gated Attention (GA) and Multi-Head Attention (MHA) represent two distinct attention formulations. Although MHA is not discussed in our main manuscript, we validated our ACMIL approach using MHA as well, with results available in Table 4 of the supplementary materials.

May 16 '25 03:05 dazhangyu123