multimodal-deep-learning topic

List multimodal-deep-learning repositories

ECCV2022-Papers-with-Code-Demo

285
Stars
23
Forks
Watchers

收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!

DeepViewAgg

216
Stars
23
Forks
Watchers

[CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"

pytorch-widedeep

1.3k
Stars
185
Forks
Watchers

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Generative-AI

778
Stars
59
Forks
Watchers

[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era

Video-to-Retail-Platform

139
Stars
43
Forks
Watchers

An intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrie...

multimodal-speech-emotion

246
Stars
69
Forks
Watchers

TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18

Pseudo-Q

139
Stars
9
Forks
Watchers

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.

mmtm

107
Stars
21
Forks
Watchers

Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"