Teng Wang
Teng Wang
Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/space...
Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
VLMixer
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)