audio-text_retrieval icon indicating copy to clipboard operation
audio-text_retrieval copied to clipboard

Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'

Results 3 audio-text_retrieval issues
Sort by recently updated
recently updated
newest added

I wish to do a reproducibility study of this work.

Hello, we are trying to set up and train the models on Mac M2 10-core GPU with 8GB RAM with MPS framework and also a GTX 1650 Ti with 4GB...

Could you explain the significance of mask in the NT-Xent loss function? ``` mask = labels.expand(n, n).eq(labels.expand(n, n).t()).to(a2t.device) mask_diag = mask.diag() mask_diag = torch.diag_embed(mask_diag) mask = mask ^ mask_diag a2t_loss...