vision-language topic

List vision-language repositories
trafficstars

VidIL

112
Stars
1
Forks
Watchers

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

Active_VLN

43
Stars
7
Forks
Watchers

The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`

NExT-QA

115
Stars
11
Forks
Watchers

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

LViT

259
Stars
24
Forks
Watchers

[IEEE Transactions on Medical Imaging/TMI] This repo is the official implementation of "LViT: Language meets Vision Transformer in Medical Image Segmentation"

VaLM

54
Stars
3
Forks
Watchers

VaLM: Visually-augmented Language Modeling. ICLR 2023.

rtic-gcn-pytorch

20
Stars
3
Forks
Watchers

Official PyTorch Implementation of RITC

PKOL

44
Stars
0
Forks
Watchers

[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”

S2-Transformer

78
Stars
4
Forks
Watchers

[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”

Chinese-CLIP

3.8k
Stars
404
Forks
Watchers

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

VLTVG

88
Stars
7
Forks
Watchers

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022