multimodal topic

List multimodal repositories

CoCa-pytorch

990
Stars
88
Forks
Watchers

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

mmf

5.4k
Stars
922
Forks
Watchers

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

erlexec

2.8k
Stars
222
Forks
Watchers

Represent, send, store and search multimodal data

docarray

2.8k
Stars
222
Forks
Watchers

Represent, send, store and search multimodal data

CLIP4Clip

792
Stars
116
Forks
Watchers

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

DALLE-mtf

435
Stars
48
Forks
Watchers

Open-AI's DALL-E for large scale training in mesh-tensorflow.

clip-guided-diffusion

448
Stars
62
Forks
Watchers

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

discoart

3.8k
Stars
246
Forks
Watchers

🪩 Create Disco Diffusion artworks in one line

mmt

249
Stars
40
Forks
Watchers

Multi-Modal Transformer for Video Retrieval

PathomicFusion

257
Stars
77
Forks
Watchers

Fusing Histology and Genomics via Deep Learning - IEEE TMI

OMML

555
Stars
98
Forks
Watchers

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.