flexivit icon indicating copy to clipboard operation
flexivit copied to clipboard

Pretrained weights

Open FBehrad opened this issue 1 year ago • 1 comments

Hello,

Thank you for preparing the code. I used the following code to create FlexiViT and then I fine-tuned it on my task. As the results are not good I was wondering whether I need to pretrain the network or if the code is already using pre trained models.

from flexivit_pytorch import (flexivit_base, flexivit_huge, flexivit_large,
                              flexivit_small, flexivit_tiny)

net = flexivit_tiny()
net = flexivit_small()
net = flexivit_base()
net = flexivit_large()
net = flexivit_huge()

FBehrad avatar Sep 08 '24 14:09 FBehrad

flexivit_tiny() and the other functions initialize the model with random weights, so yes, you should pretrain before fine-tuning on a small dataset.

bwconrad avatar Sep 08 '24 21:09 bwconrad