Mouxing Young comments

Results 8 comments of


                                            Mouxing Young

What about the ablation study of AGW on visible-infrared Re-ID?

> You may try the code in https://github.com/mangye16/Cross-Modal-Re-ID-baseline Thanks for your reply. I have tried the recommend code using triplet loss and the proposed wrt loss, respectively. However, the one...

What about the ablation study of AGW on visible-infrared Re-ID?

Hi~ I have tried the recommend code using triplet loss and the proposed wrt loss, respectively. However, the one with triplet loss exceeds that with wrt loss by ~7% and...

How about replacing the CC loss with triplet loss?

> > Hi~ > > It's a nice job. The proposed CC loss seems to narrow the modality gap, which is often done by triplet loss. So, how about replacing...

How about replacing the CC loss with triplet loss?

In my implementation, MPANet with cc loss exceeds that with triplet loss by nearly 10% in terms of the performance. I really wonder why the cc loss could help the...

question about reproducing on other baselines

Hi, Sorry for the late reply, and thanks for your interest. For SAR and EATA, we used the code from [this repository](https://github.com/mr-eggplant/SAR) and modified the input and output parameters without...

video and audio corruption reference

Hi, we follow CAV-MAE (Gong et al.) and first extract 10 frames for each video. Then, we add corruption for images following the ImageNet-C benchmark. As for the audio, we...

Pretraining data of released source models.

Hi, We just followed the finetuning pipeline of [CAV-MAE](https://github.com/YuanGongND/cav-mae/blob/master/egs/vggsound/run_cavmae_ft.sh) on the VGGSound dataset to get cav_mae_ks50.pth. The main modification is that I replaced the label weight file (NOTE: not model...

question regarding reproduction on vggsound dataset.

Hi, Sorry for the delayed response. I’ve downloaded the repo, conducted the experiment on VGGSound with Gaussian-5 using the released CMD, and successfully reproduced the results (~40.3). I noticed that...