VideoTransformer-pytorch icon indicating copy to clipboard operation
VideoTransformer-pytorch copied to clipboard

How to dataloader?

Open SuperGentry opened this issue 2 years ago • 2 comments

Hello, thank you very much for your outstanding work. I was new to computer vision, and I didn't see how the images were loaded into the model. Could you tell me how to extract 16 frames from the video and input them into the VIVIT model? Looking forward to your reply

SuperGentry avatar Sep 07 '22 08:09 SuperGentry

@SuperGentry We use decord to extract the video frames. And the details about how to read the frames from the video, you can check its official site https://github.com/dmlc/decord. After loading the video frames, the PyTorch use dataloader to organize the data for training, you can check the document from the Pytorch https://pytorch.org/docs/stable/data.html?highlight=dataloader#.

mx-mark avatar Sep 21 '22 10:09 mx-mark

thank you very much!

SuperGentry avatar Sep 21 '22 10:09 SuperGentry