go-containerregistry icon indicating copy to clipboard operation
go-containerregistry copied to clipboard

Speaker verification result

Open pierfale opened this issue 1 year ago • 4 comments

Hello,

Thank you for your work on WavLM. I try to reproduce the results but I have some difficulties.

First of all, I don't undestand exactly the difference between scores displayed in different places. For instance, on Vox1-O:

  • In WavLM paper (https://arxiv.org/pdf/2110.13900.pdf) the EER is 0.383%.
  • On the README of this repository (https://github.com/microsoft/UniSpeech#speaker-verification) the EER is 0.33%.
  • On the README of the downstream tasks (https://github.com/microsoft/UniSpeech/tree/main/downstreams/speaker_verification) the EER is 0.431%.

Moreover I tried to reproduce result from the fine-tuned checkpoint available on this repository (https://drive.google.com/file/d/1-aE1NfzpRCLxA4GUxX9ITI3F9LlbtEGP/view?usp=sharing).

I get the following result on vox1-O:

  • Without normalisation, I get EER = 0.558%
  • With s-norm, I get EER = 0.542%
  • with as-norm (cohort size = 600), I get EER = 0.505%

Do you have any more details to provide?

Thank you

pierfale avatar Jul 25 '23 11:07 pierfale

I can confirm that I obtained EER 0.558% for Vox1-O using WavLM large finetuned.

gozsoy avatar Feb 13 '24 16:02 gozsoy

Hello,

Thank you for your work on WavLM. I try to reproduce the results but I have some difficulties.

First of all, I don't undestand exactly the difference between scores displayed in different places. For instance, on Vox1-O:

  • In WavLM paper (https://arxiv.org/pdf/2110.13900.pdf) the EER is 0.383%.
  • On the README of this repository (https://github.com/microsoft/UniSpeech#speaker-verification) the EER is 0.33%.
  • On the README of the downstream tasks (https://github.com/microsoft/UniSpeech/tree/main/downstreams/speaker_verification) the EER is 0.431%.

Moreover I tried to reproduce result from the fine-tuned checkpoint available on this repository (https://drive.google.com/file/d/1-aE1NfzpRCLxA4GUxX9ITI3F9LlbtEGP/view?usp=sharing).

I get the following result on vox1-O:

  • Without normalisation, I get EER = 0.558%
  • With s-norm, I get EER = 0.542%
  • with as-norm (cohort size = 600), I get EER = 0.505%

Do you have any more details to provide?

Thank you

I also observed these differences. Have you fixed it?

gancx avatar Apr 25 '24 07:04 gancx

Same 0.558% and waiting for reply

RegulusBai avatar May 23 '24 19:05 RegulusBai