TubeViT icon indicating copy to clipboard operation
TubeViT copied to clipboard

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

Results 3 TubeViT issues
Sort by recently updated
recently updated
newest added

The number of tokens in the paper is 559 tokens (ch4.1), but the number of tokens in my implementation is 539. > - 8 x 8 x 8 with a...

implementation details

Is it possible to publicize the download link of the pre-training model? Thank you very much!

import torch import torchvision.models as models import sys import os current_dir = os.path.dirname(os.path.abspath(__file__)) parent_dir = os.path.dirname(current_dir) sys.path.insert(0, parent_dir) from tubevit.model import TubeViTLightningModule # 加载一个预训练的PyTorch模型 model = TubeViTLightningModule( num_classes=3, video_shape=[3, 1,...