multi-modal topic

List multi-modal repositories

MDVC

138
Stars
19
Forks
Watchers

PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)

SpecVQGAN

323
Stars
36
Forks
Watchers

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

awesome-visual-question-answering

647
Stars
95
Forks
Watchers

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

DeepKE

3.1k
Stars
644
Forks
Watchers

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

erlexec

2.8k
Stars
222
Forks
Watchers

Represent, send, store and search multimodal data

docarray

2.8k
Stars
222
Forks
Watchers

Represent, send, store and search multimodal data

valhalla

4.2k
Stars
657
Forks
Watchers

Open Source Routing Engine for OpenStreetMap

DALLE-pytorch

5.5k
Stars
639
Forks
Watchers

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

MedMNIST

987
Stars
155
Forks
Watchers

[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification

nemar

165
Stars
25
Forks
Watchers

[CVPR2020] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation

Transformer-in-Vision

1.3k
Stars
142
Forks
Watchers

Recent Transformer-based CV and related works.