syncnet_trainer icon indicating copy to clipboard operation
syncnet_trainer copied to clipboard

Evaluation on list save

Open annadodson787 opened this issue 5 years ago • 1 comments

Hi, I am wondering what the reasoning behind the evaluation implemented in evaluateFromListSave is - it seems to me this is loading in 2 audio files, running the audio feature extractor on them, and computing the feature-wise cosine distance between them. Where is the video pipeline in this? How is this a good evaluation metric without using the visual stream?

annadodson787 avatar Oct 27 '20 16:10 annadodson787

This part of the pipeline is trying to evaluate the quality of audio embeddings for the downstream task of speaker recognition. The audio-visual evaluation can be done using the validation script.

joonson avatar Oct 28 '20 02:10 joonson