multimodal topic
CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
erlexec
Represent, send, store and search multimodal data
docarray
Represent, send, store and search multimodal data
CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow.
clip-guided-diffusion
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
discoart
🪩 Create Disco Diffusion artworks in one line
mmt
Multi-Modal Transformer for Video Retrieval
PathomicFusion
Fusing Histology and Genomics via Deep Learning - IEEE TMI
OMML
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.