[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
foolwood
The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22).
gimpong