ggml icon indicating copy to clipboard operation
ggml copied to clipboard

add example for ImageBind by facebook

Open Green-Sky opened this issue 1 year ago • 0 comments

ImageBind: One Embedding Space To Bind Them All

ImageBind learns a joint embedding across six different modalities - images, text, audio, depth, thermal, and IMU data. It enables novel emergent applications ‘out-of-the-box’ including cross-modal retrieval, composing modalities with arithmetic, cross-modal detection and generation.

https://github.com/facebookresearch/ImageBind

they provide a pretrained checkpoint (4.5GiB)

Green-Sky avatar May 10 '23 18:05 Green-Sky