FDU
yifang
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
YoucanBaby
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT