vision-and-language topic

List vision-and-language repositories
trafficstars

LAVIS

9.3k
Stars
921
Forks
Watchers

LAVIS - A One-stop Library for Language-Vision Intelligence

awesome-visual-grounding

28
Stars
2
Forks
Watchers

awesome visual grounding: a curated list of research papers in referring visual grounding

VLDeformer

26
Stars
3
Forks
Watchers

Pytorch implement of the paper "VLDeformer: Vision Language Decomposed Transformer for Fast Cross-modal Retrieval", KBS 2022

zeroshot-storytelling

15
Stars
0
Forks
Watchers

Github repository for Zero Shot Visual Storytelling

NvEM

76
Stars
2
Forks
Watchers

[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"

visual-spatial-reasoning

87
Stars
7
Forks
Watchers

[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.

VLMbench

73
Stars
8
Forks
Watchers

NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

VL-PLM

83
Stars
8
Forks
Watchers

Exploiting unlabeled data with vision and language models for object detection, ECCV 2022

synse-zsl

29
Stars
4
Forks
Watchers

Official PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'