Thanks, Alex! It workd for the moment pretained model now. But I have the following errors for the jester pretrained model.
Could you take a look one more time?
Traceback (most recent call last):
File "test_video.py", line 111, in
net.load_state_dict(base_dict)
File "/home/vbalab/anaconda3/lib/python3.6/site-packages/torch/nn/modules/module.py", line 721, in load_state_dict
self.class.name, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for TSN:
Missing key(s) in state_dict: "base_model.conv_Conv2D.weight", "base_model.conv_Conv2D.bias", "base_model.conv_batchnorm.weight", "base_model.conv_batchnorm.bias", "base_model.conv_batchnorm.running_mean", "base_model.conv_batchnorm.running_var", "base_model.conv_1_Conv2D.weight", "base_model.conv_1_Conv2D.bias", "base_model.conv_1_batchnorm.weight", "base_model.conv_1_batchnorm.bias", "base_model.conv_1_batchnorm.running_mean", "base_model.conv_1_batchnorm.running_var", "base_model.conv_2_Conv2D.weight", "base_model.conv_2_Conv2D.bias", "base_model.conv_2_batchnorm.weight", "base_model.conv_2_batchnorm.bias", "base_model.conv_2_batchnorm.running_mean", "base_model.conv_2_batchnorm.running_var", "base_model.conv_3_Conv2D.weight", "base_model.conv_3_Conv2D.bias", "base_model.conv_3_batchnorm.weight", "base_model.conv_3_batchnorm.bias", "base_model.conv_3_batchnorm.running_mean", "base_model.conv_3_batchnorm.running_var", "base_model.conv_4_Conv2D.weight", "base_model.conv_4_Conv2D.bias", "base_model.conv_4_batchnorm.weight", "base_model.conv_4_batchnorm.bias", "base_model.conv_4_batchnorm.running_mean", "base_model.conv_4_batchnorm.running_var", "base_model.mixed_conv_Conv2D.weight", "base_model.mixed_conv_Conv2D.bias", "base_model.mixed_conv_batchnorm.weight", "base_model.mixed_conv_batchnorm.bias", "base_model.mixed_conv_batchnorm.running_mean", "base_model.mixed_conv_batchnorm.running_var", "base_model.mixed_tower_conv_Conv2D.weight", "base_model.mixed_tower_conv_Conv2D.bias", "base_model.mixed_tower_conv_batchnorm.weight", "base_model.mixed_tower_conv_batchnorm.bias", "base_model.mixed_tower_conv_batchnorm.running_mean", "base_model.mixed_tower_conv_batchnorm.running_var", "base_model.mixed_tower_conv_1_Conv2D.weight", "base_model.mixed_tower_conv_1_Conv2D.bias", "base_model.mixed_tower_conv_1_batchnorm.weight", "base_model.mixed_tower_conv_1_batchnorm.bias", "base_model.mixed_tower_conv_1_batchnorm.running_mean", "base_model.mixed_tower_conv_1_batchnorm.running_var", "base_model.mixed_tower_1_conv_Conv2D.weight", "base_model.mixed_tower_1_conv_Conv2D.bias", "base_model.mixed_tower_1_conv_batchnorm.weight", "base_model.mixed_tower_1_conv_batchnorm.bias", "base_model.mixed_tower_1_conv_batchnorm.running_mean", "base_model.mixed_tower_1_conv_batchnorm.running_var", "base_model.mixed_tower_1_conv_1_Conv2D.weight", "base_model.mixed_tower_1_conv_1_Conv2D.bias", "base_model.mixed_tower_1_conv_1_batchnorm.weight", "base_model.mixed_tower_1_conv_1_batchnorm.bias", "base_model.mixed_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_tower_1_conv_2_Conv2D.weight", "base_model.mixed_tower_1_conv_2_Conv2D.bias", "base_model.mixed_tower_1_conv_2_batchnorm.weight", "base_model.mixed_tower_1_conv_2_batchnorm.bias", "base_model.mixed_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_tower_2_conv_Conv2D.weight", "base_model.mixed_tower_2_conv_Conv2D.bias", "base_model.mixed_tower_2_conv_batchnorm.weight", "base_model.mixed_tower_2_conv_batchnorm.bias", "base_model.mixed_tower_2_conv_batchnorm.running_mean", "base_model.mixed_tower_2_conv_batchnorm.running_var", "base_model.mixed_1_conv_Conv2D.weight", "base_model.mixed_1_conv_Conv2D.bias", "base_model.mixed_1_conv_batchnorm.weight", "base_model.mixed_1_conv_batchnorm.bias", "base_model.mixed_1_conv_batchnorm.running_mean", "base_model.mixed_1_conv_batchnorm.running_var", "base_model.mixed_1_tower_conv_Conv2D.weight", "base_model.mixed_1_tower_conv_Conv2D.bias", "base_model.mixed_1_tower_conv_batchnorm.weight", "base_model.mixed_1_tower_conv_batchnorm.bias", "base_model.mixed_1_tower_conv_batchnorm.running_mean", "base_model.mixed_1_tower_conv_batchnorm.running_var", "base_model.mixed_1_tower_conv_1_Conv2D.weight", "base_model.mixed_1_tower_conv_1_Conv2D.bias", "base_model.mixed_1_tower_conv_1_batchnorm.weight", "base_model.mixed_1_tower_conv_1_batchnorm.bias", "base_model.mixed_1_tower_conv_1_batchnorm.running_mean", "base_model.mixed_1_tower_conv_1_batchnorm.running_var", "base_model.mixed_1_tower_1_conv_Conv2D.weight", "base_model.mixed_1_tower_1_conv_Conv2D.bias", "base_model.mixed_1_tower_1_conv_batchnorm.weight", "base_model.mixed_1_tower_1_conv_batchnorm.bias", "base_model.mixed_1_tower_1_conv_batchnorm.running_mean", "base_model.mixed_1_tower_1_conv_batchnorm.running_var", "base_model.mixed_1_tower_1_conv_1_Conv2D.weight", "base_model.mixed_1_tower_1_conv_1_Conv2D.bias", "base_model.mixed_1_tower_1_conv_1_batchnorm.weight", "base_model.mixed_1_tower_1_conv_1_batchnorm.bias", "base_model.mixed_1_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_1_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_1_tower_1_conv_2_Conv2D.weight", "base_model.mixed_1_tower_1_conv_2_Conv2D.bias", "base_model.mixed_1_tower_1_conv_2_batchnorm.weight", "base_model.mixed_1_tower_1_conv_2_batchnorm.bias", "base_model.mixed_1_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_1_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_1_tower_2_conv_Conv2D.weight", "base_model.mixed_1_tower_2_conv_Conv2D.bias", "base_model.mixed_1_tower_2_conv_batchnorm.weight", "base_model.mixed_1_tower_2_conv_batchnorm.bias", "base_model.mixed_1_tower_2_conv_batchnorm.running_mean", "base_model.mixed_1_tower_2_conv_batchnorm.running_var", "base_model.mixed_2_conv_Conv2D.weight", "base_model.mixed_2_conv_Conv2D.bias", "base_model.mixed_2_conv_batchnorm.weight", "base_model.mixed_2_conv_batchnorm.bias", "base_model.mixed_2_conv_batchnorm.running_mean", "base_model.mixed_2_conv_batchnorm.running_var", "base_model.mixed_2_tower_conv_Conv2D.weight", "base_model.mixed_2_tower_conv_Conv2D.bias", "base_model.mixed_2_tower_conv_batchnorm.weight", "base_model.mixed_2_tower_conv_batchnorm.bias", "base_model.mixed_2_tower_conv_batchnorm.running_mean", "base_model.mixed_2_tower_conv_batchnorm.running_var", "base_model.mixed_2_tower_conv_1_Conv2D.weight", "base_model.mixed_2_tower_conv_1_Conv2D.bias", "base_model.mixed_2_tower_conv_1_batchnorm.weight", "base_model.mixed_2_tower_conv_1_batchnorm.bias", "base_model.mixed_2_tower_conv_1_batchnorm.running_mean", "base_model.mixed_2_tower_conv_1_batchnorm.running_var", "base_model.mixed_2_tower_1_conv_Conv2D.weight", "base_model.mixed_2_tower_1_conv_Conv2D.bias", "base_model.mixed_2_tower_1_conv_batchnorm.weight", "base_model.mixed_2_tower_1_conv_batchnorm.bias", "base_model.mixed_2_tower_1_conv_batchnorm.running_mean", "base_model.mixed_2_tower_1_conv_batchnorm.running_var", "base_model.mixed_2_tower_1_conv_1_Conv2D.weight", "base_model.mixed_2_tower_1_conv_1_Conv2D.bias", "base_model.mixed_2_tower_1_conv_1_batchnorm.weight", "base_model.mixed_2_tower_1_conv_1_batchnorm.bias", "base_model.mixed_2_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_2_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_2_tower_1_conv_2_Conv2D.weight", "base_model.mixed_2_tower_1_conv_2_Conv2D.bias", "base_model.mixed_2_tower_1_conv_2_batchnorm.weight", "base_model.mixed_2_tower_1_conv_2_batchnorm.bias", "base_model.mixed_2_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_2_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_2_tower_2_conv_Conv2D.weight", "base_model.mixed_2_tower_2_conv_Conv2D.bias", "base_model.mixed_2_tower_2_conv_batchnorm.weight", "base_model.mixed_2_tower_2_conv_batchnorm.bias", "base_model.mixed_2_tower_2_conv_batchnorm.running_mean", "base_model.mixed_2_tower_2_conv_batchnorm.running_var", "base_model.mixed_3_conv_Conv2D.weight", "base_model.mixed_3_conv_Conv2D.bias", "base_model.mixed_3_conv_batchnorm.weight", "base_model.mixed_3_conv_batchnorm.bias", "base_model.mixed_3_conv_batchnorm.running_mean", "base_model.mixed_3_conv_batchnorm.running_var", "base_model.mixed_3_tower_conv_Conv2D.weight", "base_model.mixed_3_tower_conv_Conv2D.bias", "base_model.mixed_3_tower_conv_batchnorm.weight", "base_model.mixed_3_tower_conv_batchnorm.bias", "base_model.mixed_3_tower_conv_batchnorm.running_mean", "base_model.mixed_3_tower_conv_batchnorm.running_var", "base_model.mixed_3_tower_conv_1_Conv2D.weight", "base_model.mixed_3_tower_conv_1_Conv2D.bias", "base_model.mixed_3_tower_conv_1_batchnorm.weight", "base_model.mixed_3_tower_conv_1_batchnorm.bias", "base_model.mixed_3_tower_conv_1_batchnorm.running_mean", "base_model.mixed_3_tower_conv_1_batchnorm.running_var", "base_model.mixed_3_tower_conv_2_Conv2D.weight", "base_model.mixed_3_tower_conv_2_Conv2D.bias", "base_model.mixed_3_tower_conv_2_batchnorm.weight", "base_model.mixed_3_tower_conv_2_batchnorm.bias", "base_model.mixed_3_tower_conv_2_batchnorm.running_mean", "base_model.mixed_3_tower_conv_2_batchnorm.running_var", "base_model.mixed_4_conv_Conv2D.weight", "base_model.mixed_4_conv_Conv2D.bias", "base_model.mixed_4_conv_batchnorm.weight", "base_model.mixed_4_conv_batchnorm.bias", "base_model.mixed_4_conv_batchnorm.running_mean", "base_model.mixed_4_conv_batchnorm.running_var", "base_model.mixed_4_tower_conv_Conv2D.weight", "base_model.mixed_4_tower_conv_Conv2D.bias", "base_model.mixed_4_tower_conv_batchnorm.weight", "base_model.mixed_4_tower_conv_batchnorm.bias", "base_model.mixed_4_tower_conv_batchnorm.running_mean", "base_model.mixed_4_tower_conv_batchnorm.running_var", "base_model.mixed_4_tower_conv_1_Conv2D.weight", "base_model.mixed_4_tower_conv_1_Conv2D.bias", "base_model.mixed_4_tower_conv_1_batchnorm.weight", "base_model.mixed_4_tower_conv_1_batchnorm.bias", "base_model.mixed_4_tower_conv_1_batchnorm.running_mean", "base_model.mixed_4_tower_conv_1_batchnorm.running_var", "base_model.mixed_4_tower_conv_2_Conv2D.weight", "base_model.mixed_4_tower_conv_2_Conv2D.bias", "base_model.mixed_4_tower_conv_2_batchnorm.weight", "base_model.mixed_4_tower_conv_2_batchnorm.bias", "base_model.mixed_4_tower_conv_2_batchnorm.running_mean", "base_model.mixed_4_tower_conv_2_batchnorm.running_var", "base_model.mixed_4_tower_1_conv_Conv2D.weight", "base_model.mixed_4_tower_1_conv_Conv2D.bias", "base_model.mixed_4_tower_1_conv_batchnorm.weight", "base_model.mixed_4_tower_1_conv_batchnorm.bias", "base_model.mixed_4_tower_1_conv_batchnorm.running_mean", "base_model.mixed_4_tower_1_conv_batchnorm.running_var", "base_model.mixed_4_tower_1_conv_1_Conv2D.weight", "base_model.mixed_4_tower_1_conv_1_Conv2D.bias", "base_model.mixed_4_tower_1_conv_1_batchnorm.weight", "base_model.mixed_4_tower_1_conv_1_batchnorm.bias", "base_model.mixed_4_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_4_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_4_tower_1_conv_2_Conv2D.weight", "base_model.mixed_4_tower_1_conv_2_Conv2D.bias", "base_model.mixed_4_tower_1_conv_2_batchnorm.weight", "base_model.mixed_4_tower_1_conv_2_batchnorm.bias", "base_model.mixed_4_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_4_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_4_tower_1_conv_3_Conv2D.weight", "base_model.mixed_4_tower_1_conv_3_Conv2D.bias", "base_model.mixed_4_tower_1_conv_3_batchnorm.weight", "base_model.mixed_4_tower_1_conv_3_batchnorm.bias", "base_model.mixed_4_tower_1_conv_3_batchnorm.running_mean", "base_model.mixed_4_tower_1_conv_3_batchnorm.running_var", "base_model.mixed_4_tower_1_conv_4_Conv2D.weight", "base_model.mixed_4_tower_1_conv_4_Conv2D.bias", "base_model.mixed_4_tower_1_conv_4_batchnorm.weight", "base_model.mixed_4_tower_1_conv_4_batchnorm.bias", "base_model.mixed_4_tower_1_conv_4_batchnorm.running_mean", "base_model.mixed_4_tower_1_conv_4_batchnorm.running_var", "base_model.mixed_4_tower_2_conv_Conv2D.weight", "base_model.mixed_4_tower_2_conv_Conv2D.bias", "base_model.mixed_4_tower_2_conv_batchnorm.weight", "base_model.mixed_4_tower_2_conv_batchnorm.bias", "base_model.mixed_4_tower_2_conv_batchnorm.running_mean", "base_model.mixed_4_tower_2_conv_batchnorm.running_var", "base_model.mixed_5_conv_Conv2D.weight", "base_model.mixed_5_conv_Conv2D.bias", "base_model.mixed_5_conv_batchnorm.weight", "base_model.mixed_5_conv_batchnorm.bias", "base_model.mixed_5_conv_batchnorm.running_mean", "base_model.mixed_5_conv_batchnorm.running_var", "base_model.mixed_5_tower_conv_Conv2D.weight", "base_model.mixed_5_tower_conv_Conv2D.bias", "base_model.mixed_5_tower_conv_batchnorm.weight", "base_model.mixed_5_tower_conv_batchnorm.bias", "base_model.mixed_5_tower_conv_batchnorm.running_mean", "base_model.mixed_5_tower_conv_batchnorm.running_var", "base_model.mixed_5_tower_conv_1_Conv2D.weight", "base_model.mixed_5_tower_conv_1_Conv2D.bias", "base_model.mixed_5_tower_conv_1_batchnorm.weight", "base_model.mixed_5_tower_conv_1_batchnorm.bias", "base_model.mixed_5_tower_conv_1_batchnorm.running_mean", "base_model.mixed_5_tower_conv_1_batchnorm.running_var", "base_model.mixed_5_tower_conv_2_Conv2D.weight", "base_model.mixed_5_tower_conv_2_Conv2D.bias", "base_model.mixed_5_tower_conv_2_batchnorm.weight", "base_model.mixed_5_tower_conv_2_batchnorm.bias", "base_model.mixed_5_tower_conv_2_batchnorm.running_mean", "base_model.mixed_5_tower_conv_2_batchnorm.running_var", "base_model.mixed_5_tower_1_conv_Conv2D.weight", "base_model.mixed_5_tower_1_conv_Conv2D.bias", "base_model.mixed_5_tower_1_conv_batchnorm.weight", "base_model.mixed_5_tower_1_conv_batchnorm.bias", "base_model.mixed_5_tower_1_conv_batchnorm.running_mean", "base_model.mixed_5_tower_1_conv_batchnorm.running_var", "base_model.mixed_5_tower_1_conv_1_Conv2D.weight", "base_model.mixed_5_tower_1_conv_1_Conv2D.bias", "base_model.mixed_5_tower_1_conv_1_batchnorm.weight", "base_model.mixed_5_tower_1_conv_1_batchnorm.bias", "base_model.mixed_5_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_5_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_5_tower_1_conv_2_Conv2D.weight", "base_model.mixed_5_tower_1_conv_2_Conv2D.bias", "base_model.mixed_5_tower_1_conv_2_batchnorm.weight", "base_model.mixed_5_tower_1_conv_2_batchnorm.bias", "base_model.mixed_5_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_5_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_5_tower_1_conv_3_Conv2D.weight", "base_model.mixed_5_tower_1_conv_3_Conv2D.bias", "base_model.mixed_5_tower_1_conv_3_batchnorm.weight", "base_model.mixed_5_tower_1_conv_3_batchnorm.bias", "base_model.mixed_5_tower_1_conv_3_batchnorm.running_mean", "base_model.mixed_5_tower_1_conv_3_batchnorm.running_var", "base_model.mixed_5_tower_1_conv_4_Conv2D.weight", "base_model.mixed_5_tower_1_conv_4_Conv2D.bias", "base_model.mixed_5_tower_1_conv_4_batchnorm.weight", "base_model.mixed_5_tower_1_conv_4_batchnorm.bias", "base_model.mixed_5_tower_1_conv_4_batchnorm.running_mean", "base_model.mixed_5_tower_1_conv_4_batchnorm.running_var", "base_model.mixed_5_tower_2_conv_Conv2D.weight", "base_model.mixed_5_tower_2_conv_Conv2D.bias", "base_model.mixed_5_tower_2_conv_batchnorm.weight", "base_model.mixed_5_tower_2_conv_batchnorm.bias", "base_model.mixed_5_tower_2_conv_batchnorm.running_mean", "base_model.mixed_5_tower_2_conv_batchnorm.running_var", "base_model.mixed_6_conv_Conv2D.weight", "base_model.mixed_6_conv_Conv2D.bias", "base_model.mixed_6_conv_batchnorm.weight", "base_model.mixed_6_conv_batchnorm.bias", "base_model.mixed_6_conv_batchnorm.running_mean", "base_model.mixed_6_conv_batchnorm.running_var", "base_model.mixed_6_tower_conv_Conv2D.weight", "base_model.mixed_6_tower_conv_Conv2D.bias", "base_model.mixed_6_tower_conv_batchnorm.weight", "base_model.mixed_6_tower_conv_batchnorm.bias", "base_model.mixed_6_tower_conv_batchnorm.running_mean", "base_model.mixed_6_tower_conv_batchnorm.running_var", "base_model.mixed_6_tower_conv_1_Conv2D.weight", "base_model.mixed_6_tower_conv_1_Conv2D.bias", "base_model.mixed_6_tower_conv_1_batchnorm.weight", "base_model.mixed_6_tower_conv_1_batchnorm.bias", "base_model.mixed_6_tower_conv_1_batchnorm.running_mean", "base_model.mixed_6_tower_conv_1_batchnorm.running_var", "base_model.mixed_6_tower_conv_2_Conv2D.weight", "base_model.mixed_6_tower_conv_2_Conv2D.bias", "base_model.mixed_6_tower_conv_2_batchnorm.weight", "base_model.mixed_6_tower_conv_2_batchnorm.bias", "base_model.mixed_6_tower_conv_2_batchnorm.running_mean", "base_model.mixed_6_tower_conv_2_batchnorm.running_var", "base_model.mixed_6_tower_1_conv_Conv2D.weight", "base_model.mixed_6_tower_1_conv_Conv2D.bias", "base_model.mixed_6_tower_1_conv_batchnorm.weight", "base_model.mixed_6_tower_1_conv_batchnorm.bias", "base_model.mixed_6_tower_1_conv_batchnorm.running_mean", "base_model.mixed_6_tower_1_conv_batchnorm.running_var", "base_model.mixed_6_tower_1_conv_1_Conv2D.weight", "base_model.mixed_6_tower_1_conv_1_Conv2D.bias", "base_model.mixed_6_tower_1_conv_1_batchnorm.weight", "base_model.mixed_6_tower_1_conv_1_batchnorm.bias", "base_model.mixed_6_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_6_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_6_tower_1_conv_2_Conv2D.weight", "base_model.mixed_6_tower_1_conv_2_Conv2D.bias", "base_model.mixed_6_tower_1_conv_2_batchnorm.weight", "base_model.mixed_6_tower_1_conv_2_batchnorm.bias", "base_model.mixed_6_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_6_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_6_tower_1_conv_3_Conv2D.weight", "base_model.mixed_6_tower_1_conv_3_Conv2D.bias", "base_model.mixed_6_tower_1_conv_3_batchnorm.weight", "base_model.mixed_6_tower_1_conv_3_batchnorm.bias", "base_model.mixed_6_tower_1_conv_3_batchnorm.running_mean", "base_model.mixed_6_tower_1_conv_3_batchnorm.running_var", "base_model.mixed_6_tower_1_conv_4_Conv2D.weight", "base_model.mixed_6_tower_1_conv_4_Conv2D.bias", "base_model.mixed_6_tower_1_conv_4_batchnorm.weight", "base_model.mixed_6_tower_1_conv_4_batchnorm.bias", "base_model.mixed_6_tower_1_conv_4_batchnorm.running_mean", "base_model.mixed_6_tower_1_conv_4_batchnorm.running_var", "base_model.mixed_6_tower_2_conv_Conv2D.weight", "base_model.mixed_6_tower_2_conv_Conv2D.bias", "base_model.mixed_6_tower_2_conv_batchnorm.weight", "base_model.mixed_6_tower_2_conv_batchnorm.bias", "base_model.mixed_6_tower_2_conv_batchnorm.running_mean", "base_model.mixed_6_tower_2_conv_batchnorm.running_var", "base_model.mixed_7_conv_Conv2D.weight", "base_model.mixed_7_conv_Conv2D.bias", "base_model.mixed_7_conv_batchnorm.weight", "base_model.mixed_7_conv_batchnorm.bias", "base_model.mixed_7_conv_batchnorm.running_mean", "base_model.mixed_7_conv_batchnorm.running_var", "base_model.mixed_7_tower_conv_Conv2D.weight", "base_model.mixed_7_tower_conv_Conv2D.bias", "base_model.mixed_7_tower_conv_batchnorm.weight", "base_model.mixed_7_tower_conv_batchnorm.bias", "base_model.mixed_7_tower_conv_batchnorm.running_mean", "base_model.mixed_7_tower_conv_batchnorm.running_var", "base_model.mixed_7_tower_conv_1_Conv2D.weight", "base_model.mixed_7_tower_conv_1_Conv2D.bias", "base_model.mixed_7_tower_conv_1_batchnorm.weight", "base_model.mixed_7_tower_conv_1_batchnorm.bias", "base_model.mixed_7_tower_conv_1_batchnorm.running_mean", "base_model.mixed_7_tower_conv_1_batchnorm.running_var", "base_model.mixed_7_tower_conv_2_Conv2D.weight", "base_model.mixed_7_tower_conv_2_Conv2D.bias", "base_model.mixed_7_tower_conv_2_batchnorm.weight", "base_model.mixed_7_tower_conv_2_batchnorm.bias", "base_model.mixed_7_tower_conv_2_batchnorm.running_mean", "base_model.mixed_7_tower_conv_2_batchnorm.running_var", "base_model.mixed_7_tower_1_conv_Conv2D.weight", "base_model.mixed_7_tower_1_conv_Conv2D.bias", "base_model.mixed_7_tower_1_conv_batchnorm.weight", "base_model.mixed_7_tower_1_conv_batchnorm.bias", "base_model.mixed_7_tower_1_conv_batchnorm.running_mean", "base_model.mixed_7_tower_1_conv_batchnorm.running_var", "base_model.mixed_7_tower_1_conv_1_Conv2D.weight", "base_model.mixed_7_tower_1_conv_1_Conv2D.bias", "base_model.mixed_7_tower_1_conv_1_batchnorm.weight", "base_model.mixed_7_tower_1_conv_1_batchnorm.bias", "base_model.mixed_7_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_7_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_7_tower_1_conv_2_Conv2D.weight", "base_model.mixed_7_tower_1_conv_2_Conv2D.bias", "base_model.mixed_7_tower_1_conv_2_batchnorm.weight", "base_model.mixed_7_tower_1_conv_2_batchnorm.bias", "base_model.mixed_7_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_7_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_7_tower_1_conv_3_Conv2D.weight", "base_model.mixed_7_tower_1_conv_3_Conv2D.bias", "base_model.mixed_7_tower_1_conv_3_batchnorm.weight", "base_model.mixed_7_tower_1_conv_3_batchnorm.bias", "base_model.mixed_7_tower_1_conv_3_batchnorm.running_mean", "base_model.mixed_7_tower_1_conv_3_batchnorm.running_var", "base_model.mixed_7_tower_1_conv_4_Conv2D.weight", "base_model.mixed_7_tower_1_conv_4_Conv2D.bias", "base_model.mixed_7_tower_1_conv_4_batchnorm.weight", "base_model.mixed_7_tower_1_conv_4_batchnorm.bias", "base_model.mixed_7_tower_1_conv_4_batchnorm.running_mean", "base_model.mixed_7_tower_1_conv_4_batchnorm.running_var", "base_model.mixed_7_tower_2_conv_Conv2D.weight", "base_model.mixed_7_tower_2_conv_Conv2D.bias", "base_model.mixed_7_tower_2_conv_batchnorm.weight", "base_model.mixed_7_tower_2_conv_batchnorm.bias", "base_model.mixed_7_tower_2_conv_batchnorm.running_mean", "base_model.mixed_7_tower_2_conv_batchnorm.running_var", "base_model.mixed_8_tower_conv_Conv2D.weight", "base_model.mixed_8_tower_conv_Conv2D.bias", "base_model.mixed_8_tower_conv_batchnorm.weight", "base_model.mixed_8_tower_conv_batchnorm.bias", "base_model.mixed_8_tower_conv_batchnorm.running_mean", "base_model.mixed_8_tower_conv_batchnorm.running_var", "base_model.mixed_8_tower_conv_1_Conv2D.weight", "base_model.mixed_8_tower_conv_1_Conv2D.bias", "base_model.mixed_8_tower_conv_1_batchnorm.weight", "base_model.mixed_8_tower_conv_1_batchnorm.bias", "base_model.mixed_8_tower_conv_1_batchnorm.running_mean", "base_model.mixed_8_tower_conv_1_batchnorm.running_var", "base_model.mixed_8_tower_1_conv_Conv2D.weight", "base_model.mixed_8_tower_1_conv_Conv2D.bias", "base_model.mixed_8_tower_1_conv_batchnorm.weight", "base_model.mixed_8_tower_1_conv_batchnorm.bias", "base_model.mixed_8_tower_1_conv_batchnorm.running_mean", "base_model.mixed_8_tower_1_conv_batchnorm.running_var", "base_model.mixed_8_tower_1_conv_1_Conv2D.weight", "base_model.mixed_8_tower_1_conv_1_Conv2D.bias", "base_model.mixed_8_tower_1_conv_1_batchnorm.weight", "base_model.mixed_8_tower_1_conv_1_batchnorm.bias", "base_model.mixed_8_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_8_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_8_tower_1_conv_2_Conv2D.weight", "base_model.mixed_8_tower_1_conv_2_Conv2D.bias", "base_model.mixed_8_tower_1_conv_2_batchnorm.weight", "base_model.mixed_8_tower_1_conv_2_batchnorm.bias", "base_model.mixed_8_tower_1_conv_2_batchnorm.running_mean", "base_model.mixed_8_tower_1_conv_2_batchnorm.running_var", "base_model.mixed_8_tower_1_conv_3_Conv2D.weight", "base_model.mixed_8_tower_1_conv_3_Conv2D.bias", "base_model.mixed_8_tower_1_conv_3_batchnorm.weight", "base_model.mixed_8_tower_1_conv_3_batchnorm.bias", "base_model.mixed_8_tower_1_conv_3_batchnorm.running_mean", "base_model.mixed_8_tower_1_conv_3_batchnorm.running_var", "base_model.mixed_9_conv_Conv2D.weight", "base_model.mixed_9_conv_Conv2D.bias", "base_model.mixed_9_conv_batchnorm.weight", "base_model.mixed_9_conv_batchnorm.bias", "base_model.mixed_9_conv_batchnorm.running_mean", "base_model.mixed_9_conv_batchnorm.running_var", "base_model.mixed_9_tower_conv_Conv2D.weight", "base_model.mixed_9_tower_conv_Conv2D.bias", "base_model.mixed_9_tower_conv_batchnorm.weight", "base_model.mixed_9_tower_conv_batchnorm.bias", "base_model.mixed_9_tower_conv_batchnorm.running_mean", "base_model.mixed_9_tower_conv_batchnorm.running_var", "base_model.mixed_9_tower_mixed_conv_Conv2D.weight", "base_model.mixed_9_tower_mixed_conv_Conv2D.bias", "base_model.mixed_9_tower_mixed_conv_batchnorm.weight", "base_model.mixed_9_tower_mixed_conv_batchnorm.bias", "base_model.mixed_9_tower_mixed_conv_batchnorm.running_mean", "base_model.mixed_9_tower_mixed_conv_batchnorm.running_var", "base_model.mixed_9_tower_mixed_conv_1_Conv2D.weight", "base_model.mixed_9_tower_mixed_conv_1_Conv2D.bias", "base_model.mixed_9_tower_mixed_conv_1_batchnorm.weight", "base_model.mixed_9_tower_mixed_conv_1_batchnorm.bias", "base_model.mixed_9_tower_mixed_conv_1_batchnorm.running_mean", "base_model.mixed_9_tower_mixed_conv_1_batchnorm.running_var", "base_model.mixed_9_tower_1_conv_Conv2D.weight", "base_model.mixed_9_tower_1_conv_Conv2D.bias", "base_model.mixed_9_tower_1_conv_batchnorm.weight", "base_model.mixed_9_tower_1_conv_batchnorm.bias", "base_model.mixed_9_tower_1_conv_batchnorm.running_mean", "base_model.mixed_9_tower_1_conv_batchnorm.running_var", "base_model.mixed_9_tower_1_conv_1_Conv2D.weight", "base_model.mixed_9_tower_1_conv_1_Conv2D.bias", "base_model.mixed_9_tower_1_conv_1_batchnorm.weight", "base_model.mixed_9_tower_1_conv_1_batchnorm.bias", "base_model.mixed_9_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_9_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_9_tower_1_mixed_conv_Conv2D.weight", "base_model.mixed_9_tower_1_mixed_conv_Conv2D.bias", "base_model.mixed_9_tower_1_mixed_conv_batchnorm.weight", "base_model.mixed_9_tower_1_mixed_conv_batchnorm.bias", "base_model.mixed_9_tower_1_mixed_conv_batchnorm.running_mean", "base_model.mixed_9_tower_1_mixed_conv_batchnorm.running_var", "base_model.mixed_9_tower_1_mixed_conv_1_Conv2D.weight", "base_model.mixed_9_tower_1_mixed_conv_1_Conv2D.bias", "base_model.mixed_9_tower_1_mixed_conv_1_batchnorm.weight", "base_model.mixed_9_tower_1_mixed_conv_1_batchnorm.bias", "base_model.mixed_9_tower_1_mixed_conv_1_batchnorm.running_mean", "base_model.mixed_9_tower_1_mixed_conv_1_batchnorm.running_var", "base_model.mixed_9_tower_2_conv_Conv2D.weight", "base_model.mixed_9_tower_2_conv_Conv2D.bias", "base_model.mixed_9_tower_2_conv_batchnorm.weight", "base_model.mixed_9_tower_2_conv_batchnorm.bias", "base_model.mixed_9_tower_2_conv_batchnorm.running_mean", "base_model.mixed_9_tower_2_conv_batchnorm.running_var", "base_model.mixed_10_conv_Conv2D.weight", "base_model.mixed_10_conv_Conv2D.bias", "base_model.mixed_10_conv_batchnorm.weight", "base_model.mixed_10_conv_batchnorm.bias", "base_model.mixed_10_conv_batchnorm.running_mean", "base_model.mixed_10_conv_batchnorm.running_var", "base_model.mixed_10_tower_conv_Conv2D.weight", "base_model.mixed_10_tower_conv_Conv2D.bias", "base_model.mixed_10_tower_conv_batchnorm.weight", "base_model.mixed_10_tower_conv_batchnorm.bias", "base_model.mixed_10_tower_conv_batchnorm.running_mean", "base_model.mixed_10_tower_conv_batchnorm.running_var", "base_model.mixed_10_tower_mixed_conv_Conv2D.weight", "base_model.mixed_10_tower_mixed_conv_Conv2D.bias", "base_model.mixed_10_tower_mixed_conv_batchnorm.weight", "base_model.mixed_10_tower_mixed_conv_batchnorm.bias", "base_model.mixed_10_tower_mixed_conv_batchnorm.running_mean", "base_model.mixed_10_tower_mixed_conv_batchnorm.running_var", "base_model.mixed_10_tower_mixed_conv_1_Conv2D.weight", "base_model.mixed_10_tower_mixed_conv_1_Conv2D.bias", "base_model.mixed_10_tower_mixed_conv_1_batchnorm.weight", "base_model.mixed_10_tower_mixed_conv_1_batchnorm.bias", "base_model.mixed_10_tower_mixed_conv_1_batchnorm.running_mean", "base_model.mixed_10_tower_mixed_conv_1_batchnorm.running_var", "base_model.mixed_10_tower_1_conv_Conv2D.weight", "base_model.mixed_10_tower_1_conv_Conv2D.bias", "base_model.mixed_10_tower_1_conv_batchnorm.weight", "base_model.mixed_10_tower_1_conv_batchnorm.bias", "base_model.mixed_10_tower_1_conv_batchnorm.running_mean", "base_model.mixed_10_tower_1_conv_batchnorm.running_var", "base_model.mixed_10_tower_1_conv_1_Conv2D.weight", "base_model.mixed_10_tower_1_conv_1_Conv2D.bias", "base_model.mixed_10_tower_1_conv_1_batchnorm.weight", "base_model.mixed_10_tower_1_conv_1_batchnorm.bias", "base_model.mixed_10_tower_1_conv_1_batchnorm.running_mean", "base_model.mixed_10_tower_1_conv_1_batchnorm.running_var", "base_model.mixed_10_tower_1_mixed_conv_Conv2D.weight", "base_model.mixed_10_tower_1_mixed_conv_Conv2D.bias", "base_model.mixed_10_tower_1_mixed_conv_batchnorm.weight", "base_model.mixed_10_tower_1_mixed_conv_batchnorm.bias", "base_model.mixed_10_tower_1_mixed_conv_batchnorm.running_mean", "base_model.mixed_10_tower_1_mixed_conv_batchnorm.running_var", "base_model.mixed_10_tower_1_mixed_conv_1_Conv2D.weight", "base_model.mixed_10_tower_1_mixed_conv_1_Conv2D.bias", "base_model.mixed_10_tower_1_mixed_conv_1_batchnorm.weight", "base_model.mixed_10_tower_1_mixed_conv_1_batchnorm.bias", "base_model.mixed_10_tower_1_mixed_conv_1_batchnorm.running_mean", "base_model.mixed_10_tower_1_mixed_conv_1_batchnorm.running_var", "base_model.mixed_10_tower_2_conv_Conv2D.weight", "base_model.mixed_10_tower_2_conv_Conv2D.bias", "base_model.mixed_10_tower_2_conv_batchnorm.weight", "base_model.mixed_10_tower_2_conv_batchnorm.bias", "base_model.mixed_10_tower_2_conv_batchnorm.running_mean", "base_model.mixed_10_tower_2_conv_batchnorm.running_var".
Unexpected key(s) in state_dict: "base_model.conv1_7x7_s2.weight", "base_model.conv1_7x7_s2.bias", "base_model.conv1_7x7_s2_bn.weight", "base_model.conv1_7x7_s2_bn.bias", "base_model.conv1_7x7_s2_bn.running_mean", "base_model.conv1_7x7_s2_bn.running_var", "base_model.conv2_3x3_reduce.weight", "base_model.conv2_3x3_reduce.bias", "base_model.conv2_3x3_reduce_bn.weight", "base_model.conv2_3x3_reduce_bn.bias", "base_model.conv2_3x3_reduce_bn.running_mean", "base_model.conv2_3x3_reduce_bn.running_var", "base_model.conv2_3x3.weight", "base_model.conv2_3x3.bias", "base_model.conv2_3x3_bn.weight", "base_model.conv2_3x3_bn.bias", "base_model.conv2_3x3_bn.running_mean", "base_model.conv2_3x3_bn.running_var", "base_model.inception_3a_1x1.weight", "base_model.inception_3a_1x1.bias", "base_model.inception_3a_1x1_bn.weight", "base_model.inception_3a_1x1_bn.bias", "base_model.inception_3a_1x1_bn.running_mean", "base_model.inception_3a_1x1_bn.running_var", "base_model.inception_3a_3x3_reduce.weight", "base_model.inception_3a_3x3_reduce.bias", "base_model.inception_3a_3x3_reduce_bn.weight", "base_model.inception_3a_3x3_reduce_bn.bias", "base_model.inception_3a_3x3_reduce_bn.running_mean", "base_model.inception_3a_3x3_reduce_bn.running_var", "base_model.inception_3a_3x3.weight", "base_model.inception_3a_3x3.bias", "base_model.inception_3a_3x3_bn.weight", "base_model.inception_3a_3x3_bn.bias", "base_model.inception_3a_3x3_bn.running_mean", "base_model.inception_3a_3x3_bn.running_var", "base_model.inception_3a_double_3x3_reduce.weight", "base_model.inception_3a_double_3x3_reduce.bias", "base_model.inception_3a_double_3x3_reduce_bn.weight", "base_model.inception_3a_double_3x3_reduce_bn.bias", "base_model.inception_3a_double_3x3_reduce_bn.running_mean", "base_model.inception_3a_double_3x3_reduce_bn.running_var", "base_model.inception_3a_double_3x3_1.weight", "base_model.inception_3a_double_3x3_1.bias", "base_model.inception_3a_double_3x3_1_bn.weight", "base_model.inception_3a_double_3x3_1_bn.bias", "base_model.inception_3a_double_3x3_1_bn.running_mean", "base_model.inception_3a_double_3x3_1_bn.running_var", "base_model.inception_3a_double_3x3_2.weight", "base_model.inception_3a_double_3x3_2.bias", "base_model.inception_3a_double_3x3_2_bn.weight", "base_model.inception_3a_double_3x3_2_bn.bias", "base_model.inception_3a_double_3x3_2_bn.running_mean", "base_model.inception_3a_double_3x3_2_bn.running_var", "base_model.inception_3a_pool_proj.weight", "base_model.inception_3a_pool_proj.bias", "base_model.inception_3a_pool_proj_bn.weight", "base_model.inception_3a_pool_proj_bn.bias", "base_model.inception_3a_pool_proj_bn.running_mean", "base_model.inception_3a_pool_proj_bn.running_var", "base_model.inception_3b_1x1.weight", "base_model.inception_3b_1x1.bias", "base_model.inception_3b_1x1_bn.weight", "base_model.inception_3b_1x1_bn.bias", "base_model.inception_3b_1x1_bn.running_mean", "base_model.inception_3b_1x1_bn.running_var", "base_model.inception_3b_3x3_reduce.weight", "base_model.inception_3b_3x3_reduce.bias", "base_model.inception_3b_3x3_reduce_bn.weight", "base_model.inception_3b_3x3_reduce_bn.bias", "base_model.inception_3b_3x3_reduce_bn.running_mean", "base_model.inception_3b_3x3_reduce_bn.running_var", "base_model.inception_3b_3x3.weight", "base_model.inception_3b_3x3.bias", "base_model.inception_3b_3x3_bn.weight", "base_model.inception_3b_3x3_bn.bias", "base_model.inception_3b_3x3_bn.running_mean", "base_model.inception_3b_3x3_bn.running_var", "base_model.inception_3b_double_3x3_reduce.weight", "base_model.inception_3b_double_3x3_reduce.bias", "base_model.inception_3b_double_3x3_reduce_bn.weight", "base_model.inception_3b_double_3x3_reduce_bn.bias", "base_model.inception_3b_double_3x3_reduce_bn.running_mean", "base_model.inception_3b_double_3x3_reduce_bn.running_var", "base_model.inception_3b_double_3x3_1.weight", "base_model.inception_3b_double_3x3_1.bias", "base_model.inception_3b_double_3x3_1_bn.weight", "base_model.inception_3b_double_3x3_1_bn.bias", "base_model.inception_3b_double_3x3_1_bn.running_mean", "base_model.inception_3b_double_3x3_1_bn.running_var", "base_model.inception_3b_double_3x3_2.weight", "base_model.inception_3b_double_3x3_2.bias", "base_model.inception_3b_double_3x3_2_bn.weight", "base_model.inception_3b_double_3x3_2_bn.bias", "base_model.inception_3b_double_3x3_2_bn.running_mean", "base_model.inception_3b_double_3x3_2_bn.running_var", "base_model.inception_3b_pool_proj.weight", "base_model.inception_3b_pool_proj.bias", "base_model.inception_3b_pool_proj_bn.weight", "base_model.inception_3b_pool_proj_bn.bias", "base_model.inception_3b_pool_proj_bn.running_mean", "base_model.inception_3b_pool_proj_bn.running_var", "base_model.inception_3c_3x3_reduce.weight", "base_model.inception_3c_3x3_reduce.bias", "base_model.inception_3c_3x3_reduce_bn.weight", "base_model.inception_3c_3x3_reduce_bn.bias", "base_model.inception_3c_3x3_reduce_bn.running_mean", "base_model.inception_3c_3x3_reduce_bn.running_var", "base_model.inception_3c_3x3.weight", "base_model.inception_3c_3x3.bias", "base_model.inception_3c_3x3_bn.weight", "base_model.inception_3c_3x3_bn.bias", "base_model.inception_3c_3x3_bn.running_mean", "base_model.inception_3c_3x3_bn.running_var", "base_model.inception_3c_double_3x3_reduce.weight", "base_model.inception_3c_double_3x3_reduce.bias", "base_model.inception_3c_double_3x3_reduce_bn.weight", "base_model.inception_3c_double_3x3_reduce_bn.bias", "base_model.inception_3c_double_3x3_reduce_bn.running_mean", "base_model.inception_3c_double_3x3_reduce_bn.running_var", "base_model.inception_3c_double_3x3_1.weight", "base_model.inception_3c_double_3x3_1.bias", "base_model.inception_3c_double_3x3_1_bn.weight", "base_model.inception_3c_double_3x3_1_bn.bias", "base_model.inception_3c_double_3x3_1_bn.running_mean", "base_model.inception_3c_double_3x3_1_bn.running_var", "base_model.inception_3c_double_3x3_2.weight", "base_model.inception_3c_double_3x3_2.bias", "base_model.inception_3c_double_3x3_2_bn.weight", "base_model.inception_3c_double_3x3_2_bn.bias", "base_model.inception_3c_double_3x3_2_bn.running_mean", "base_model.inception_3c_double_3x3_2_bn.running_var", "base_model.inception_4a_1x1.weight", "base_model.inception_4a_1x1.bias", "base_model.inception_4a_1x1_bn.weight", "base_model.inception_4a_1x1_bn.bias", "base_model.inception_4a_1x1_bn.running_mean", "base_model.inception_4a_1x1_bn.running_var", "base_model.inception_4a_3x3_reduce.weight", "base_model.inception_4a_3x3_reduce.bias", "base_model.inception_4a_3x3_reduce_bn.weight", "base_model.inception_4a_3x3_reduce_bn.bias", "base_model.inception_4a_3x3_reduce_bn.running_mean", "base_model.inception_4a_3x3_reduce_bn.running_var", "base_model.inception_4a_3x3.weight", "base_model.inception_4a_3x3.bias", "base_model.inception_4a_3x3_bn.weight", "base_model.inception_4a_3x3_bn.bias", "base_model.inception_4a_3x3_bn.running_mean", "base_model.inception_4a_3x3_bn.running_var", "base_model.inception_4a_double_3x3_reduce.weight", "base_model.inception_4a_double_3x3_reduce.bias", "base_model.inception_4a_double_3x3_reduce_bn.weight", "base_model.inception_4a_double_3x3_reduce_bn.bias", "base_model.inception_4a_double_3x3_reduce_bn.running_mean", "base_model.inception_4a_double_3x3_reduce_bn.running_var", "base_model.inception_4a_double_3x3_1.weight", "base_model.inception_4a_double_3x3_1.bias", "base_model.inception_4a_double_3x3_1_bn.weight", "base_model.inception_4a_double_3x3_1_bn.bias", "base_model.inception_4a_double_3x3_1_bn.running_mean", "base_model.inception_4a_double_3x3_1_bn.running_var", "base_model.inception_4a_double_3x3_2.weight", "base_model.inception_4a_double_3x3_2.bias", "base_model.inception_4a_double_3x3_2_bn.weight", "base_model.inception_4a_double_3x3_2_bn.bias", "base_model.inception_4a_double_3x3_2_bn.running_mean", "base_model.inception_4a_double_3x3_2_bn.running_var", "base_model.inception_4a_pool_proj.weight", "base_model.inception_4a_pool_proj.bias", "base_model.inception_4a_pool_proj_bn.weight", "base_model.inception_4a_pool_proj_bn.bias", "base_model.inception_4a_pool_proj_bn.running_mean", "base_model.inception_4a_pool_proj_bn.running_var", "base_model.inception_4b_1x1.weight", "base_model.inception_4b_1x1.bias", "base_model.inception_4b_1x1_bn.weight", "base_model.inception_4b_1x1_bn.bias", "base_model.inception_4b_1x1_bn.running_mean", "base_model.inception_4b_1x1_bn.running_var", "base_model.inception_4b_3x3_reduce.weight", "base_model.inception_4b_3x3_reduce.bias", "base_model.inception_4b_3x3_reduce_bn.weight", "base_model.inception_4b_3x3_reduce_bn.bias", "base_model.inception_4b_3x3_reduce_bn.running_mean", "base_model.inception_4b_3x3_reduce_bn.running_var", "base_model.inception_4b_3x3.weight", "base_model.inception_4b_3x3.bias", "base_model.inception_4b_3x3_bn.weight", "base_model.inception_4b_3x3_bn.bias", "base_model.inception_4b_3x3_bn.running_mean", "base_model.inception_4b_3x3_bn.running_var", "base_model.inception_4b_double_3x3_reduce.weight", "base_model.inception_4b_double_3x3_reduce.bias", "base_model.inception_4b_double_3x3_reduce_bn.weight", "base_model.inception_4b_double_3x3_reduce_bn.bias", "base_model.inception_4b_double_3x3_reduce_bn.running_mean", "base_model.inception_4b_double_3x3_reduce_bn.running_var", "base_model.inception_4b_double_3x3_1.weight", "base_model.inception_4b_double_3x3_1.bias", "base_model.inception_4b_double_3x3_1_bn.weight", "base_model.inception_4b_double_3x3_1_bn.bias", "base_model.inception_4b_double_3x3_1_bn.running_mean", "base_model.inception_4b_double_3x3_1_bn.running_var", "base_model.inception_4b_double_3x3_2.weight", "base_model.inception_4b_double_3x3_2.bias", "base_model.inception_4b_double_3x3_2_bn.weight", "base_model.inception_4b_double_3x3_2_bn.bias", "base_model.inception_4b_double_3x3_2_bn.running_mean", "base_model.inception_4b_double_3x3_2_bn.running_var", "base_model.inception_4b_pool_proj.weight", "base_model.inception_4b_pool_proj.bias", "base_model.inception_4b_pool_proj_bn.weight", "base_model.inception_4b_pool_proj_bn.bias", "base_model.inception_4b_pool_proj_bn.running_mean", "base_model.inception_4b_pool_proj_bn.running_var", "base_model.inception_4c_1x1.weight", "base_model.inception_4c_1x1.bias", "base_model.inception_4c_1x1_bn.weight", "base_model.inception_4c_1x1_bn.bias", "base_model.inception_4c_1x1_bn.running_mean", "base_model.inception_4c_1x1_bn.running_var", "base_model.inception_4c_3x3_reduce.weight", "base_model.inception_4c_3x3_reduce.bias", "base_model.inception_4c_3x3_reduce_bn.weight", "base_model.inception_4c_3x3_reduce_bn.bias", "base_model.inception_4c_3x3_reduce_bn.running_mean", "base_model.inception_4c_3x3_reduce_bn.running_var", "base_model.inception_4c_3x3.weight", "base_model.inception_4c_3x3.bias", "base_model.inception_4c_3x3_bn.weight", "base_model.inception_4c_3x3_bn.bias", "base_model.inception_4c_3x3_bn.running_mean", "base_model.inception_4c_3x3_bn.running_var", "base_model.inception_4c_double_3x3_reduce.weight", "base_model.inception_4c_double_3x3_reduce.bias", "base_model.inception_4c_double_3x3_reduce_bn.weight", "base_model.inception_4c_double_3x3_reduce_bn.bias", "base_model.inception_4c_double_3x3_reduce_bn.running_mean", "base_model.inception_4c_double_3x3_reduce_bn.running_var", "base_model.inception_4c_double_3x3_1.weight", "base_model.inception_4c_double_3x3_1.bias", "base_model.inception_4c_double_3x3_1_bn.weight", "base_model.inception_4c_double_3x3_1_bn.bias", "base_model.inception_4c_double_3x3_1_bn.running_mean", "base_model.inception_4c_double_3x3_1_bn.running_var", "base_model.inception_4c_double_3x3_2.weight", "base_model.inception_4c_double_3x3_2.bias", "base_model.inception_4c_double_3x3_2_bn.weight", "base_model.inception_4c_double_3x3_2_bn.bias", "base_model.inception_4c_double_3x3_2_bn.running_mean", "base_model.inception_4c_double_3x3_2_bn.running_var", "base_model.inception_4c_pool_proj.weight", "base_model.inception_4c_pool_proj.bias", "base_model.inception_4c_pool_proj_bn.weight", "base_model.inception_4c_pool_proj_bn.bias", "base_model.inception_4c_pool_proj_bn.running_mean", "base_model.inception_4c_pool_proj_bn.running_var", "base_model.inception_4d_1x1.weight", "base_model.inception_4d_1x1.bias", "base_model.inception_4d_1x1_bn.weight", "base_model.inception_4d_1x1_bn.bias", "base_model.inception_4d_1x1_bn.running_mean", "base_model.inception_4d_1x1_bn.running_var", "base_model.inception_4d_3x3_reduce.weight", "base_model.inception_4d_3x3_reduce.bias", "base_model.inception_4d_3x3_reduce_bn.weight", "base_model.inception_4d_3x3_reduce_bn.bias", "base_model.inception_4d_3x3_reduce_bn.running_mean", "base_model.inception_4d_3x3_reduce_bn.running_var", "base_model.inception_4d_3x3.weight", "base_model.inception_4d_3x3.bias", "base_model.inception_4d_3x3_bn.weight", "base_model.inception_4d_3x3_bn.bias", "base_model.inception_4d_3x3_bn.running_mean", "base_model.inception_4d_3x3_bn.running_var", "base_model.inception_4d_double_3x3_reduce.weight", "base_model.inception_4d_double_3x3_reduce.bias", "base_model.inception_4d_double_3x3_reduce_bn.weight", "base_model.inception_4d_double_3x3_reduce_bn.bias", "base_model.inception_4d_double_3x3_reduce_bn.running_mean", "base_model.inception_4d_double_3x3_reduce_bn.running_var", "base_model.inception_4d_double_3x3_1.weight", "base_model.inception_4d_double_3x3_1.bias", "base_model.inception_4d_double_3x3_1_bn.weight", "base_model.inception_4d_double_3x3_1_bn.bias", "base_model.inception_4d_double_3x3_1_bn.running_mean", "base_model.inception_4d_double_3x3_1_bn.running_var", "base_model.inception_4d_double_3x3_2.weight", "base_model.inception_4d_double_3x3_2.bias", "base_model.inception_4d_double_3x3_2_bn.weight", "base_model.inception_4d_double_3x3_2_bn.bias", "base_model.inception_4d_double_3x3_2_bn.running_mean", "base_model.inception_4d_double_3x3_2_bn.running_var", "base_model.inception_4d_pool_proj.weight", "base_model.inception_4d_pool_proj.bias", "base_model.inception_4d_pool_proj_bn.weight", "base_model.inception_4d_pool_proj_bn.bias", "base_model.inception_4d_pool_proj_bn.running_mean", "base_model.inception_4d_pool_proj_bn.running_var", "base_model.inception_4e_3x3_reduce.weight", "base_model.inception_4e_3x3_reduce.bias", "base_model.inception_4e_3x3_reduce_bn.weight", "base_model.inception_4e_3x3_reduce_bn.bias", "base_model.inception_4e_3x3_reduce_bn.running_mean", "base_model.inception_4e_3x3_reduce_bn.running_var", "base_model.inception_4e_3x3.weight", "base_model.inception_4e_3x3.bias", "base_model.inception_4e_3x3_bn.weight", "base_model.inception_4e_3x3_bn.bias", "base_model.inception_4e_3x3_bn.running_mean", "base_model.inception_4e_3x3_bn.running_var", "base_model.inception_4e_double_3x3_reduce.weight", "base_model.inception_4e_double_3x3_reduce.bias", "base_model.inception_4e_double_3x3_reduce_bn.weight", "base_model.inception_4e_double_3x3_reduce_bn.bias", "base_model.inception_4e_double_3x3_reduce_bn.running_mean", "base_model.inception_4e_double_3x3_reduce_bn.running_var", "base_model.inception_4e_double_3x3_1.weight", "base_model.inception_4e_double_3x3_1.bias", "base_model.inception_4e_double_3x3_1_bn.weight", "base_model.inception_4e_double_3x3_1_bn.bias", "base_model.inception_4e_double_3x3_1_bn.running_mean", "base_model.inception_4e_double_3x3_1_bn.running_var", "base_model.inception_4e_double_3x3_2.weight", "base_model.inception_4e_double_3x3_2.bias", "base_model.inception_4e_double_3x3_2_bn.weight", "base_model.inception_4e_double_3x3_2_bn.bias", "base_model.inception_4e_double_3x3_2_bn.running_mean", "base_model.inception_4e_double_3x3_2_bn.running_var", "base_model.inception_5a_1x1.weight", "base_model.inception_5a_1x1.bias", "base_model.inception_5a_1x1_bn.weight", "base_model.inception_5a_1x1_bn.bias", "base_model.inception_5a_1x1_bn.running_mean", "base_model.inception_5a_1x1_bn.running_var", "base_model.inception_5a_3x3_reduce.weight", "base_model.inception_5a_3x3_reduce.bias", "base_model.inception_5a_3x3_reduce_bn.weight", "base_model.inception_5a_3x3_reduce_bn.bias", "base_model.inception_5a_3x3_reduce_bn.running_mean", "base_model.inception_5a_3x3_reduce_bn.running_var", "base_model.inception_5a_3x3.weight", "base_model.inception_5a_3x3.bias", "base_model.inception_5a_3x3_bn.weight", "base_model.inception_5a_3x3_bn.bias", "base_model.inception_5a_3x3_bn.running_mean", "base_model.inception_5a_3x3_bn.running_var", "base_model.inception_5a_double_3x3_reduce.weight", "base_model.inception_5a_double_3x3_reduce.bias", "base_model.inception_5a_double_3x3_reduce_bn.weight", "base_model.inception_5a_double_3x3_reduce_bn.bias", "base_model.inception_5a_double_3x3_reduce_bn.running_mean", "base_model.inception_5a_double_3x3_reduce_bn.running_var", "base_model.inception_5a_double_3x3_1.weight", "base_model.inception_5a_double_3x3_1.bias", "base_model.inception_5a_double_3x3_1_bn.weight", "base_model.inception_5a_double_3x3_1_bn.bias", "base_model.inception_5a_double_3x3_1_bn.running_mean", "base_model.inception_5a_double_3x3_1_bn.running_var", "base_model.inception_5a_double_3x3_2.weight", "base_model.inception_5a_double_3x3_2.bias", "base_model.inception_5a_double_3x3_2_bn.weight", "base_model.inception_5a_double_3x3_2_bn.bias", "base_model.inception_5a_double_3x3_2_bn.running_mean", "base_model.inception_5a_double_3x3_2_bn.running_var", "base_model.inception_5a_pool_proj.weight", "base_model.inception_5a_pool_proj.bias", "base_model.inception_5a_pool_proj_bn.weight", "base_model.inception_5a_pool_proj_bn.bias", "base_model.inception_5a_pool_proj_bn.running_mean", "base_model.inception_5a_pool_proj_bn.running_var", "base_model.inception_5b_1x1.weight", "base_model.inception_5b_1x1.bias", "base_model.inception_5b_1x1_bn.weight", "base_model.inception_5b_1x1_bn.bias", "base_model.inception_5b_1x1_bn.running_mean", "base_model.inception_5b_1x1_bn.running_var", "base_model.inception_5b_3x3_reduce.weight", "base_model.inception_5b_3x3_reduce.bias", "base_model.inception_5b_3x3_reduce_bn.weight", "base_model.inception_5b_3x3_reduce_bn.bias", "base_model.inception_5b_3x3_reduce_bn.running_mean", "base_model.inception_5b_3x3_reduce_bn.running_var", "base_model.inception_5b_3x3.weight", "base_model.inception_5b_3x3.bias", "base_model.inception_5b_3x3_bn.weight", "base_model.inception_5b_3x3_bn.bias", "base_model.inception_5b_3x3_bn.running_mean", "base_model.inception_5b_3x3_bn.running_var", "base_model.inception_5b_double_3x3_reduce.weight", "base_model.inception_5b_double_3x3_reduce.bias", "base_model.inception_5b_double_3x3_reduce_bn.weight", "base_model.inception_5b_double_3x3_reduce_bn.bias", "base_model.inception_5b_double_3x3_reduce_bn.running_mean", "base_model.inception_5b_double_3x3_reduce_bn.running_var", "base_model.inception_5b_double_3x3_1.weight", "base_model.inception_5b_double_3x3_1.bias", "base_model.inception_5b_double_3x3_1_bn.weight", "base_model.inception_5b_double_3x3_1_bn.bias", "base_model.inception_5b_double_3x3_1_bn.running_mean", "base_model.inception_5b_double_3x3_1_bn.running_var", "base_model.inception_5b_double_3x3_2.weight", "base_model.inception_5b_double_3x3_2.bias", "base_model.inception_5b_double_3x3_2_bn.weight", "base_model.inception_5b_double_3x3_2_bn.bias", "base_model.inception_5b_double_3x3_2_bn.running_mean", "base_model.inception_5b_double_3x3_2_bn.running_var", "base_model.inception_5b_pool_proj.weight", "base_model.inception_5b_pool_proj.bias", "base_model.inception_5b_pool_proj_bn.weight", "base_model.inception_5b_pool_proj_bn.bias", "base_model.inception_5b_pool_proj_bn.running_mean", "base_model.inception_5b_pool_proj_bn.running_var".
While copying the parameter named "new_fc.weight", whose dimensions in the model are torch.Size([256, 2048]) and whose dimensions in the checkpoint are torch.Size([256, 1024]).