Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching
Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching copied to clipboard
Question about the code of CMPC loss.
Hello. Thank you for your excellent work! But I noticed a problem with CMPC Loss. In the paper, "the vector projection of image feature xi onto normalized text feature ̄ zi" is not implemented according to the formula. In the code, the calculation is done directly by using the dot product. Is this code inconsistent with the writing of the paper? It is not known whether this has any impact on model performance. Could you provide some explanation? Thanks a lot!