multi-modal-deep-learning topic

List multi-modal-deep-learning repositories

glami-1m

68
Stars
6
Forks
Watchers

The largest multilingual image-text classification dataset. It contains fashion products.

Husformer

104
Stars
30
Forks
Watchers

This repository contains the source code for our paper: "Husformer: A Multi-Modal Transformer for Multi-Modal Human State Recognition". For more details, please refer to our paper at https://arxiv.org...