multi-modal-learning topic

List multi-modal-learning repositories

HyperDenseNet_pytorch

78
Stars
12
Forks
Watchers

Pytorch version of the HyperDenseNet deep neural network for multi-modal image segmentation

x-clip

660
Stars
46
Forks
Watchers

A concise but complete implementation of CLIP with various experimental improvements from recent papers

open_clip

8.7k
Stars
875
Forks
39
Watchers

An open source implementation of CLIP.

awesome-visual-question-answering

647
Stars
95
Forks
Watchers

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

nemar

165
Stars
25
Forks
Watchers

[CVPR2020] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation

A curated list of vision-and-language pre-training (VLP). :-)

Code repository for Rakuten Data Challenge: Multimodal Product Classification and Retrieval.

Chinese-CLIP

3.8k
Stars
404
Forks
Watchers

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

NeuralMerger

20
Stars
3
Forks
Watchers

Yi-Min Chou, Yi-Ming Chan, Jia-Hong Lee, Chih-Yi Chiu, Chu-Song Chen, "Unifying and Merging Well-trained Deep Neural Networks for Inference Stage," International Joint Conference on Artificial Intelli...

Multimodal-Remote-Sensing-Toolkit

74
Stars
12
Forks
Watchers

A python tool to perform deep learning experiments on multimodal remote sensing data.