TokenMix icon indicating copy to clipboard operation
TokenMix copied to clipboard

Have you considered TokenMix in hidden layers?

Open cmsflash opened this issue 3 years ago • 5 comments

In addition to TokenMix before the first Transformer block, have you considered or tried TokenMix in the middle of the model?

cmsflash avatar Sep 10 '22 01:09 cmsflash

I did try it. If I understand correctly, it is close to Manifold Mixup. I believe this will be an interesting extension of CutMix/TokenMix (etc) in the feature space. Have you tried CutMix in the feature space?

jihaonew avatar Sep 21 '22 15:09 jihaonew

Yep, I am indeed effectively talking about Manfold Mixup for TokenMix/CutMix. The Manifold Mixup paper is very interesting and thank you so much for bringing it up. I haven't tried the hidden layer version of CutMix/TokenMix. If you have tried hidden-layer TokenMix, could you share the results?

cmsflash avatar Sep 21 '22 20:09 cmsflash

OK.

jihaonew avatar Sep 22 '22 01:09 jihaonew

Great! Do you want to share the results on this issue or somewhere else?

cmsflash avatar Sep 22 '22 05:09 cmsflash

@jihaonew Did you get any results?

cmsflash avatar Oct 16 '24 00:10 cmsflash