multi-modal topic
OASIS
Official implementation of the paper "You Only Need Adversarial Supervision for Semantic Image Synthesis" (ICLR 2021)
TSIT
[ECCV 2020 Spotlight] A Simple and Versatile Framework for Image-to-Image Translation
Caesar.jl
Robust robotic localization and mapping, together with NavAbility(TM). Reach out to [email protected] for help.
Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised l...
DevKit
A Javascript library for developing modules for COBI.Bike – the perfect fusion of smartphone and bike.
IncrementalInference.jl
Clique recycling non-Gaussian (multi-modal) factor graph solver; also see Caesar.jl.
iPerceive
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention...
Image-Text-Papers
Image Caption and Text to Image papers.
Audio2Head
code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021
HiNet
Code for TMI 2020 "Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis"