multimodal-deep-learning topic
ECCV2022-Papers-with-Code-Demo
收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!
DeepViewAgg
[CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"
pytorch-widedeep
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
Generative-AI
[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
Video-to-Retail-Platform
An intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrie...
multimodal-speech-emotion
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
MUStARD
Multimodal Sarcasm Detection Dataset
Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Self-Supervised-Embedding-Fusion-Transformer
The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.
mmtm
Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"