[FEATURE] Add weights of Microsoft's BiomedCLIP

Open NightMachinery opened this issue 2 years ago • 0 comments

https://huggingface.co/microsoft/BiomedCLIP-PubMedBERT_256-vit_base_patch16_224

BiomedCLIP is a biomedical vision-language foundation model that is pretrained on PMC-15M, a dataset of 15 million figure-caption pairs extracted from biomedical research articles in PubMed Central, using contrastive learning. It uses PubMedBERT as the text encoder and Vision Transformer as the image encoder, with domain-specific adaptations.

Sep 22 '23 08:09 NightMachinery