multimodal-deep-learning topic
referit3d
Code accompanying our ECCV-2020 paper on 3D Neural Listeners.
awesome-Vision-and-Language-Pre-training
Recent Advances in Vision and Language Pre-training (VLP)
embracenet
Robust multimodal integration method implemented in PyTorch and TensorFlow
muscaps
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
DJ-RN
As a part of HAKE project (HAKE-3D). Code for our CVPR2020 paper "Detailed 2D-3D Joint Representation for Human-Object Interaction".
Multimodal-Infomax
This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNL...
Awesome-Multimodality
A Survey on multimodal learning research.
Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
MultiGraphGAN
MultiGraphGAN for predicting multiple target graphs from a source graph using geometric deep learning.