audio-text_retrieval
audio-text_retrieval copied to clipboard
How to to obtain the downstream retrieval performace?
I wish to do a reproducibility study of this work.
any luck?