multimodal topic
Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
MNMT
Pytorch implementation of Multimodal Neural Machine Translation(MNMT).
MAGIC
Language Models Can See: Plugging Visual Controls in Text Generation
DeepGCCA-pytorch
An implementation of Deep Generalized Canonical Correlation Analysis (DGCCA or Deep GCCA) with pytorch.
UVR-NMT
Neural Machine Translation with universal Visual Representation (ICLR 2020)
Taris
Transformer-based online speech recognition system with TensorFlow 2
clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
wit
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
jina
☁️ Build multimodal AI applications with cloud-native stack
OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework