mage Cannot load the fine-tuned ckpt

Unexpected key(s) in state_dict: "head.weight", "head.bias", "fc_norm.weight", "fc_norm.bias". 
size mismatch for pos_embed: copying a param with shape torch.Size([1, 197, 1024]) from checkpoint, the shape in current model is torch.Size([1, 257, 1024]).

checkpoint = torch.load(ckpt_path, map_location='cpu')
model.load_state_dict(checkpoint['model'])

I am loading the mage-vitl-ft.pth but didn't work. do we need conversion scripts?

Jan 23 '24 20:01 hlzhang109

What is the model you are using? It seems the initialized model does not contain fc_norm and head. The correct model to use for fine-tuned models should be vit_large_patch16 from models_vit_mage.py, instead of vit_large_patch16 from models_mage.py.

Jan 23 '24 20:01 LTH14

What is the model you are using? It seems the initialized model does not contain fc_norm and head. The correct model to use for fine-tuned models should be vit_large_patch16 from models_vit_mage.py, instead of vit_large_patch16 from models_mage.py.

Hello, I have encountered the same issue. In "main_finetune. py", the model was imported through "models_vit_mage. py" with the names "vit_base_patch16" and "vit_large_patch16", while in "gen_img_uncond. py", the model was imported from "models_mage. py" with the names "mage_vit_base_patch16" and "mage_vit_large_patch16", When using a model trained through "main_finetune. py" in "gen_img_uncond. py", it prompts this question. I am not sure if it is possible to roughly modify "model = models_mage.dict[args.model]......." to "model = models_vit_mage.dict[args.model]......, because their parameters are different, how can I modify them to use the model I trained myself for generation?

Jan 30 '24 08:01 rememberBr

@rememberBr A fine-tuned model is fine-tuned for classification and cannot be used for generation.

Jan 30 '24 10:01 LTH14

@rememberBr A fine-tuned model is fine-tuned for classification and cannot be used for generation.

OK，thanks

Jan 31 '24 01:01 rememberBr