ImageBind
ImageBind copied to clipboard
Simply replacing Detic's CLIP-based ‘class’ enbedding with imagebind audio embedding
Thanks for your good jobs!!! I tried this, audio embedding dim of imagebind is 1024, but Detic model need embedding of 512 dim,Can you release matched model?For example,imagebind_base.pth?