NestedFormer Understanding TokenLearner

Understanding TokenLearner

Open AshStuff opened this issue 1 year ago • 1 comments

I am trying to understand the tokenlearner and I guess the current implementation is different from the original TokenLearner proposed right ? I guess the original Tokenlearner performs the token learner for each temporal dimension and then reshape back.

Mar 23 '23 00:03 AshStuff

In this work, token learner only controls the sequence length. It doesn't need to reshape back because the network has a segmentation decoder.

Mar 23 '23 04:03 920232796

NestedFormer NestedFormer copied to clipboard

Understanding TokenLearner

NestedFormer
NestedFormer copied to clipboard