voxceleb_trainer icon indicating copy to clipboard operation
voxceleb_trainer copied to clipboard

Cannot achieve paper's RawNet3 results using an official recipe

Open happyjin opened this issue 2 years ago • 4 comments

Dear author,

I cannot reimplement the paper's results using the RawNet3 script, which should get EER 0.8932. I am wondering if the paper's result is wrong. Can you please upload a recipe so that we can reimplement the result on paper?

happyjin avatar Oct 11 '22 14:10 happyjin

Hi, can you share your results?

I cannot share the exact training recipe because it includes internal codes, which is one of the reasons why I shared trained weight parameters. However, model architecture is exactly the same and you should be getting similar results.

Jungjee avatar Oct 18 '22 11:10 Jungjee

Hi, we're encountering a similar issue. The pre-trained RawNet3 achieves an EER of 0.9809% with the full-length enroll and full-len test utterances. But when we train RawNet3 with the voxceleb1 & 2 dev set and use noise and reverberation addition as augmentation methods, the EER increases to 1.20% after 40 epochs. Besides, the EER rises to 1.4% after applying speed perturbation for voxceleb 2 dev.

During training, the mixedprec and distributed arguments are used to accelerate training.

Could you please provide some advice on how to address this? Thank you!

JunLi0514 avatar Dec 13 '23 03:12 JunLi0514

@JunLi0514 , hi, thanks for reporting your status. Speaking of EER 0.98%, did you follow the same setup by segmenting it into ten 4-second segments? If you input the full-length utterance, it would be different to what we did and hence result might be affected.

Note that I trained RawNet3 using another codebase. Only the model architecture has been updated to this repo (VoxCeleb_trainer).

FYI, due to my changed affiliation, I recently developed a RawNet3 reproducible recipe in ESPnet2, where I achieved EER of 0.73% with RawNet3 and it was reproducible several times when tested.

Jungjee avatar Dec 13 '23 16:12 Jungjee

Hi, @Jungjee , thank you for your quick reply! The work is impressive and the training recipe described in the paper is detailed. With your kind advice, we'll test the pretrained model with a duration of 4s and try training on ESPnet2 ^ ^

JunLi0514 avatar Dec 14 '23 01:12 JunLi0514