multimodal-learning topic

List multimodal-learning repositories

Awsome-Deep-Learning-for-Video-Analysis

724
Stars
168
Forks
Watchers

Papers, code and datasets about deep learning and multi-modal learning for video analysis

OMML

555
Stars
98
Forks
Watchers

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Multimodal-Toolkit

557
Stars
83
Forks
Watchers

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

multimodal-deep-learning

662
Stars
141
Forks
Watchers

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

MultiBench

441
Stars
64
Forks
Watchers

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

mvits_for_class_agnostic_od

296
Stars
24
Forks
Watchers

[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".

multimodal-vae-public

149
Stars
37
Forks
Watchers

A PyTorch implementation of "Multimodal Generative Models for Scalable Weakly-Supervised Learning" (https://arxiv.org/abs/1802.05335)

MMVID

189
Stars
21
Forks
Watchers

[CVPR 2022] Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

vista-net

89
Stars
19
Forks
Watchers

Code for the paper "VistaNet: Visual Aspect Attention Network for Multimodal Sentiment Analysis", AAAI'19