CLAP icon indicating copy to clipboard operation
CLAP copied to clipboard

Evaluations

Open cfoster0 opened this issue 4 years ago • 1 comments

We can start thinking about how to evaluate our models once trained. The simplest would be contrastive loss over some held out set with a fixed batch size. Another would be to evaluate how well CLAP score correlates with MOS (Mean Opinion Score), which is the gold standard subjective eval in the audio NN literature. We could probably also try linear probe training on the Google Speech Commands dataset.

cfoster0 avatar May 02 '21 15:05 cfoster0

Speak of the devil! Here's a new big benchmark that does just what I'm after.

https://arxiv.org/abs/2105.01051

cfoster0 avatar May 04 '21 01:05 cfoster0