few-shot-action-recognition icon indicating copy to clipboard operation
few-shot-action-recognition copied to clipboard

A considerable gap from the SOTA

Open Brave-peng opened this issue 4 years ago • 3 comments
trafficstars

I am trying to reimplement the OTAM and using the default arguments, this is options and the result in ssv2。

Options: Namespace( dataset='data/ssv2_256x256q5', learning_rate=0.001, tasks_per_batch=16, checkpoint_dir='checkpoint_dir', test_model_name='checkpoint_best_val.pt', training_iterations=60000, resume_from_checkpoint=False, way=5, shot=5, query_per_class=5, query_per_class_test=1, val_iters=[], num_val_tasks=1000, num_test_tasks=10000, print_freq=100, seq_len=8, num_workers=8, backbone='resnet50', opt='sgd', save_freq=5000, img_size=224, num_gpus=4, sch=[1000000], method='otam', pretrained_backbone=None, val_on_test=False, trans_linear_in_dim=2048)

test accuracies: data/ssv2_256x256q5: 44.5+/-0.4

This is a considerable gap from the SOTA. Instead, I also testing in the kinetics and using the TRX, only get the 63.2% accuracy(5-way-1-shot). Could you tell me what might be causing this?

Brave-peng avatar Nov 15 '21 13:11 Brave-peng

It's been a while since I've used this, but I seem to remember that OTAM works best with tasks_per_batch=1 (they don't mention gradient accumulation in their paper). Others who've used this code to replicate the OTAM results used that and it seemed to work OK. Hope that helps.

tobyperrett avatar Nov 15 '21 13:11 tobyperrett

In fact, that's what the setting is in scripts/examples.md. So just try running that command.

tobyperrett avatar Nov 15 '21 13:11 tobyperrett

Thank you very much for your quick reply, which is very helpful to me

Brave-peng avatar Nov 16 '21 02:11 Brave-peng