Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching icon indicating copy to clipboard operation
Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching copied to clipboard

Deep Cross-Modal Projection Learning for Image-Text Matching

Results 11 Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching issues
Sort by recently updated
recently updated
newest added

I use python 3.6.4, Pytorch1.0.0 & torchvision 0.2.1, scipy 1.2.1. The results in paper 'Deep Cross-Modal Pojection Learning for Image-Text Matching' on CUHK-PEDES are:{top- 1 = 49.37%,top-10 = 79.27%}, but...

File "/home/dalian/xuxu/Image-Text-Matching/train.py", line 96, in main ema = EMA(args.ema_decay) NameError: name 'EMA' is not defined

Notice that the results in paper 'Deep Cross-Modal Pojection Learning for Image-Text Matching' are:{top- 1 = 49.37%,top-10 = 79.27%} while the results in this project are {top- 1 = 42.999%,top-10...

the argment "checkpoint_dir" should be "model_path"

Could you please give information regarding pre-processing of the data? Is there any requirement of downloading the raw data or pre-processed will work?

Hi, I appreciate your pytorch version of this work. I want to know if this code can reproduce the same performance/results as the paper reports (cmpm:rank1 44.02 cmpm+cmpc:rank1 49.37 on...

Hello. Thank you for your excellent work! But I noticed a problem with CMPC Loss. In the paper, "the vector projection of image feature xi onto normalized text feature ̄...

原论文里面说pi是预测概率分布,qi是真实概率分布。如果是这样的话,那Li应该等于\sum{j=1}{n}qijlog(qij/pij)吧?