flex-dm icon indicating copy to clipboard operation
flex-dm copied to clipboard

Embedding dimensions

Open Ashwin-Pokharel opened this issue 1 year ago • 1 comments

Hello , I was a bit confused as the supplementary material and paper describes that image and text features are extracted in 768 dimension using CLIP , however looking at the code the embeddings are described as having 512 dimensional shape. Is there something I'm missing or is there a way you are downscaling from 768 to 512 dimension

Ashwin-Pokharel avatar Sep 17 '23 20:09 Ashwin-Pokharel