VLCounter
VLCounter copied to clipboard
[AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting
if args.EVALUATION.ckpt_used is not None: filepath = os.path.join(root_model, f'{args.EVALUATION.ckpt_used}.pth') assert os.path.isfile(filepath), filepath print("=> loading model weight '{}'".format(filepath),flush=True) checkpoint = torch.load(filepath) model.load_state_dict(checkpoint['state_dict']) print("=> loaded model weight '{}'".format(filepath),flush=True) 
https://github.com/Seunggu0305/VLCounter/blob/2dc15ddd218744c2c3c63b667fa0bc4a24ce8c3c/tools/models/ViT_Encoder_add.py#L122-L128 I noticed that the maskCLIP strategy was implemented in the code and the MLP of CLIP layers was removed. Could you provide the results without this strategy? Additionally, would...
The work is so great! I just train the FSC-dataset, but I want to know how to train the Fish-dataset. Thanks.