VideoTransformer-pytorch
VideoTransformer-pytorch copied to clipboard
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
When I want to finetune my dataset based on pretrained kinetics vivit model, the errors occured. I am new to pytorch, may I know How could solve the following errors?...
What are the detailed steps and best practices for using the ViViT model to effectively extract video features for various video analysis tasks? I would greatly appreciate any guidance or...
@mx-mark Is it possible to fine-tune ViViT model on my own video dataset with different set of classes? Also, what is the procedure to create new dataset?
通俗易懂,适合新手,感谢感谢
Predictions and targets are expected to have the same shape, but got torch.Size([8, 400]) and torch.Size([8]).
Hi, Great work! I was just wondering how to modify the implemented code for a regression task. Furthermore, can we use custom dataloader for the datasets as the dataset I...
Hi @mx-mark Do you have a log file for experiment of ViViT fine-tuning with Imagenet-21k pre-train weights? I am referring to following experiment: `` python model_pretrain.py -lr 0.005 -epoch 30...
Hello thanks for your works. i have a simple question. i downloaded a pretrained weight(ViT) from google research github. and i just wanna know that how can i recognize my...