vision-language topic

List vision-language repositories

BLIP

4.3k
Stars
573
Forks
Watchers

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

OFA

2.3k
Stars
247
Forks
Watchers

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Kaleido-BERT

264
Stars
19
Forks
Watchers

💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

cliport

424
Stars
78
Forks
Watchers

CLIPort: What and Where Pathways for Robotic Manipulation

Vision-Language-Transformer

335
Stars
21
Forks
Watchers

[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation

pix2seq

823
Stars
67
Forks
Watchers

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

calvin

275
Stars
44
Forks
Watchers

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

vse_infty

149
Stars
18
Forks
Watchers

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

ContraCLIP

41
Stars
0
Forks
Watchers

Authors official PyTorch implementation of the "ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences".