multi-modality topic

List multi-modality repositories

clip-as-service

12.2k
Stars
2.1k
Forks
Watchers

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

jina

21.0k
Stars
2.2k
Forks
179
Watchers

☁️ Build multimodal AI applications with cloud-native stack

mmMOT

252
Stars
25
Forks
Watchers

[ICCV2019] Robust Multi-Modality Multi-Object Tracking

deep-daze

4.4k
Stars
327
Forks
Watchers

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

UVTR

219
Stars
16
Forks
Watchers

Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)

CVPR21Chal-SLR

220
Stars
55
Forks
Watchers

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

CRIS.pytorch

270
Stars
38
Forks
Watchers

An official PyTorch implementation of the CRIS paper

ComposeAE

54
Stars
17
Forks
Watchers

Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval

TRAR-VQA

63
Stars
18
Forks
Watchers

[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"

Awesome-Multimodal-Large-Language-Models

11.9k
Stars
765
Forks
208
Watchers

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models