Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching
Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching copied to clipboard
Pre-processing Data
Could you please give information regarding pre-processing of the data? Is there any requirement of downloading the raw data or pre-processed will work?