VideoTransformer-pytorch
VideoTransformer-pytorch copied to clipboard
structure of ViViT-b
What is the structure of model ViViT-b you published? I can't read it with the default parameters
@nullhty There are two parts of model structure, the first one is a spatial-only transformer and the last one is a temporal-only transformer.