multimodal-deep-learning topic

List multimodal-deep-learning repositories

Awesome-3D-Vision-and-Language

100
Stars
5
Forks
100
Watchers

A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.

attentive-modality-hopping-for-SER

31
Stars
9
Forks
Watchers

TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20

Multimodal-Learning

27
Stars
3
Forks
Watchers

This repository contains the source code for the paper "Improving the performance of unimodal dynamic hand gesture recognition with multimodal training"

mmae

18
Stars
11
Forks
Watchers

Package for Multimodal Autoencoders in TensorFlow / Keras

DeepCU-IJCAI19

19
Stars
8
Forks
Watchers

DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19

SUTD-TrafficQA

49
Stars
2
Forks
Watchers

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

hateful_memes-hate_detectron

53
Stars
19
Forks
Watchers

Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. https://arxiv.org/abs/2012.12975

visual-spatial-reasoning

87
Stars
7
Forks
Watchers

[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.