VLCounter issues

Hi, I'm running into a weight structure mismatch when I run the test.py

11

if args.EVALUATION.ckpt_used is not None: filepath = os.path.join(root_model, f'{args.EVALUATION.ckpt_used}.pth') assert os.path.isfile(filepath), filepath print("=> loading model weight '{}'".format(filepath),flush=True) checkpoint = torch.load(filepath) model.load_state_dict(checkpoint['state_dict']) print("=> loaded model weight '{}'".format(filepath),flush=True) ![error](https://github.com/Seunggu0305/VLCounter/assets/80736560/8e6ed881-d642-4b2d-be3b-65fc4e5a72c1)

Minsky520

About the strategy of CLIP

1

https://github.com/Seunggu0305/VLCounter/blob/2dc15ddd218744c2c3c63b667fa0bc4a24ce8c3c/tools/models/ViT_Encoder_add.py#L122-L128 I noticed that the maskCLIP strategy was implemented in the code and the MLP of CLIP layers was removed. Could you provide the results without this strategy? Additionally, would...

nanfangAlan