vision-and-language topics

RaNet

30

Stars

3

Forks

Watchers

source code of our RaNet in EMNLP 2021

Huntersxsx

emnlp2021

natural-language-video-localization

ranet

temporal-sentence-grounding

pytorch_sscr

23

Stars

5

Forks

Watchers

A PyTorch implementation of SSCR

tsujuifu

computer-vision

emnlp2020

image-editing

pytorch

HiREST

90

Stars

9

Forks

Watchers

Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)

j-min

hirest

moment-retrieval

moment-segmentation

step-captioning

FactualSceneGraph

85

Stars

12

Forks

Watchers

FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.

zhuang-li

natural-language-processing

scene-graph

vision-and-language

x-lxmert

50

Stars

10

Forks

Watchers

PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"

allenai

ai2

emnlp2020

image-generation

pretrained-models

PartGlot

30

Stars

4

Forks

Watchers

Official Implementation of PartGlot (CVPR 2022 Oral)

KAIST-Visual-AI-Group

computer-vision

deep-learning

nlp

vision-and-language

lang2seg

30

Stars

8

Forks

Watchers

Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019

wenz116

computer-vision

cycle-consistency

deep-learning

object-detection

TSGV-Learning-List

31

Stars

3

Forks

Watchers

Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作

Huntersxsx

natural-language-video-localization

temporal-sentence-grounding

video-moment-retrieval

vision-and-language

GroundVLP

33

Stars

2

Forks

Watchers

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

om-ai-lab

multimodal

object-detection

vision-and-language

zero-shot-learning

MGPN

18

Stars

1

Forks

Watchers

source code of our MGPN in SIGIR 2022

Huntersxsx

mgpn

natural-language-video-localization

sigir2022

temporal-sentence-grounding