multi-modality topic

List multi-modality repositories

clip-as-service

12.2k

Stars

2.1k

Forks

Watchers

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

bert-as-service

clip-as-service

jina

21.0k

Stars

2.2k

Forks

Watchers

☁️ Build multimodal AI applications with cloud-native stack

mmMOT

252

Stars

25

Forks

Watchers

[ICCV2019] Robust Multi-Modality Multi-Object Tracking

deep-daze

4.4k

Stars

327

Forks

Watchers

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

artificial-intelligence

implicit-neural-representation

UVTR

219

Stars

16

Forks

Watchers

Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)

CVPR21Chal-SLR

220

Stars

55

Forks

Watchers

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

sign-language-recognition

sign-language-recognition-system

CRIS.pytorch

270

Stars

38

Forks

Watchers

An official PyTorch implementation of the CRIS paper

contrastive-learning

referring-image-segmentation

ComposeAE

54

Stars

17

Forks

Watchers

Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval

compositional-learning

information-retrieval

TRAR-VQA

63

Stars

18

Forks

Watchers

[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"

dynamic-network

Awesome-Multimodal-Large-Language-Models

11.9k

Stars

765

Forks

Watchers

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

chain-of-thought

in-context-learning

instruction-following

instruction-tuning