video-classification icon indicating copy to clipboard operation
video-classification copied to clipboard

images used in train

Open 123liluky opened this issue 5 years ago • 2 comments

In UCF101_ResNetCRNN.py: begin_frame, end_frame, skip_frame = 1, 29, 1 selected_frames = np.arange(begin_frame, end_frame, skip_frame).tolist() train_set, valid_set = Dataset_CRNN(data_path, train_list, train_label, selected_frames, transform=transform), Dataset_CRNN(data_path, test_list, test_label, selected_frames, transform=transform)

So, you just use the first 28 images in a video folder to train model? The left images are not used. Am I right?

123liluky avatar Mar 01 '20 09:03 123liluky

yes, in order to have fixed size tensors as inputs for CNNs.

HHTseng avatar Mar 01 '20 20:03 HHTseng

hi Tseng thanks for your repo. X,y-> What is the size of the X here .. as per your conv1 structure in EncoderCNN, the no of dim of input should be 4 but you are passing 5 dim. dint understand this

for t in range(x_3d.size(1)):
            # CNNs
            x = self.conv1(x_3d[:, t, :, :, :])
            x = self.conv2(x)
            x = self.conv3(x)
            x = self.conv4(x)
            x = x.view(x.size(0), -1)    

jaideep11061982 avatar Mar 11 '20 11:03 jaideep11061982