multimodal topic

List multimodal repositories

Awesome-Multimodal-Research

1.3k
Stars
150
Forks
Watchers

A curated list of Multimodal Related Research.

MNMT

12
Stars
0
Forks
Watchers

Pytorch implementation of Multimodal Neural Machine Translation(MNMT).

MAGIC

251
Stars
27
Forks
Watchers

Language Models Can See: Plugging Visual Controls in Text Generation

DeepGCCA-pytorch

47
Stars
13
Forks
Watchers

An implementation of Deep Generalized Canonical Correlation Analysis (DGCCA or Deep GCCA) with pytorch.

UVR-NMT

87
Stars
21
Forks
Watchers

Neural Machine Translation with universal Visual Representation (ICLR 2020)

Taris

25
Stars
6
Forks
Watchers

Transformer-based online speech recognition system with TensorFlow 2

clip-retrieval

2.2k
Stars
198
Forks
Watchers

Easily compute clip embeddings and build a clip retrieval system with them

wit

959
Stars
39
Forks
Watchers

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

jina

20.2k
Stars
2.2k
Forks
179
Watchers

☁️ Build multimodal AI applications with cloud-native stack

OFA

2.3k
Stars
245
Forks
Watchers

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework