VLMixer
VLMixer copied to clipboard
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)