TubeViT
TubeViT copied to clipboard
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
The number of tokens in the paper is 559 tokens (ch4.1), but the number of tokens in my implementation is 539. > - 8 x 8 x 8 with a...
Is it possible to publicize the download link of the pre-training model? Thank you very much!
import torch import torchvision.models as models import sys import os current_dir = os.path.dirname(os.path.abspath(__file__)) parent_dir = os.path.dirname(current_dir) sys.path.insert(0, parent_dir) from tubevit.model import TubeViTLightningModule # 加载一个预训练的PyTorch模型 model = TubeViTLightningModule( num_classes=3, video_shape=[3, 1,...