Teng Wang

Results 6 repositories owned by Teng Wang

A curated list of prompt-based paper in computer vision and vision-language learning.

PDVC

190
Stars
22
Forks
Watchers

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

dense-video-captioning-pytorch

72
Stars
23
Forks
Watchers

Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)

Caption-Anything

1.7k
Stars
103
Forks
Watchers

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/space...

Awesome_Long_Form_Video_Understanding

326
Stars
14
Forks
326
Watchers

Awesome papers & datasets specifically focused on long-term videos.

VLMixer

17
Stars
1
Forks
Watchers

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)