recognize-anything
recognize-anything copied to clipboard
why the tag number is 40898
for ram++ the label_embed size is [2085798, 512]. since each tag has 51 descriptions, so the tags number is 40898 = 5085798/51. but in the configs of finetune.yaml for ram++ is 4585 tags. why the real tag number in model is 40898 not 4585
The label embedding of RAM++ is [233835,512], in here https://huggingface.co/xinyu1205/recognize-anything-plus-model/blob/main/ram_plus_tag_embedding_class_4585_des_51.pth