multimodal-representation topic
FVTA_MemexQA
Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19
cornac
A Comparative Framework for Multimodal Recommender Systems
BERT-like-is-All-You-Need
The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
BBFN
This repository contains the implementation of the paper -- Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment Analysis
mica-deep-mcca
Deep Multiset Canonical Correlation Analysis - An extension of CCA to multiple datasets
awesome-open-papernotes
Yet another Ph.D. adventure.
pyWikiMM
Collects a multimodal dataset of Wikipedia articles and their images
IISAN
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT