AliceMind
AliceMind copied to clipboard
how to get the pre-trained model "ViT-L-14.tar"
I download the pre-trained model "ViT-L-14.pt"x and its feature is 768. However, the vision_width in yaml file is set 1024. This is different.