BSN-boundary-sensitive-network icon indicating copy to clipboard operation
BSN-boundary-sensitive-network copied to clipboard

Feature extraction for THUMOS14 is strange

Open makecent opened this issue 4 years ago • 4 comments

According to the "Implementation Details" part in Section 4 of the original paper, you use the model pre-trained on the training set of ActivityNet-1.3 as the features extractor. And I don't make anything wrong, your BMN uses the output of the last layer as the feature. That's why you get 400-dimensions length features (200 classes, 2 streams).

But the output of the last layer should represent the predicted class scores of the input frame of 200 ActivityNet actions. This kind of feature should be meaningless for THUMOS14 because half of the action classes in THUMOS14 don't belong to ActivityNet. How can you detect an unseen type of action using the classification scores of 200 irrelative actions?

makecent avatar Oct 20 '20 16:10 makecent

the same problem

chenshen03 avatar Aug 29 '21 04:08 chenshen03

Maybe he means it is extracted after training with ActivityNet-1.3 pretrained model? It will make sense then.

xianguo-dev avatar Jun 30 '22 12:06 xianguo-dev

Maybe he means it is extracted after training with ActivityNet-1.3 pretrained model? It will make sense then.

The feature extractor (pretrained model) is fixed without further training.

makecent avatar Jun 30 '22 13:06 makecent