voxceleb_trainer Cannot achieve paper's RawNet3 results using an official recipe

Cannot achieve paper's RawNet3 results using an official recipe

Open happyjin opened this issue 2 years ago • 4 comments

Dear author,

I cannot reimplement the paper's results using the RawNet3 script, which should get EER 0.8932. I am wondering if the paper's result is wrong. Can you please upload a recipe so that we can reimplement the result on paper?

Oct 11 '22 14:10 happyjin

Hi, can you share your results?

I cannot share the exact training recipe because it includes internal codes, which is one of the reasons why I shared trained weight parameters. However, model architecture is exactly the same and you should be getting similar results.

Oct 18 '22 11:10 Jungjee

Hi, we're encountering a similar issue. The pre-trained RawNet3 achieves an EER of 0.9809% with the full-length enroll and full-len test utterances. But when we train RawNet3 with the voxceleb1 & 2 dev set and use noise and reverberation addition as augmentation methods, the EER increases to 1.20% after 40 epochs. Besides, the EER rises to 1.4% after applying speed perturbation for voxceleb 2 dev.

During training, the mixedprec and distributed arguments are used to accelerate training.

Could you please provide some advice on how to address this? Thank you!

Dec 13 '23 03:12 JunLi0514

@JunLi0514 , hi, thanks for reporting your status. Speaking of EER 0.98%, did you follow the same setup by segmenting it into ten 4-second segments? If you input the full-length utterance, it would be different to what we did and hence result might be affected.

Note that I trained RawNet3 using another codebase. Only the model architecture has been updated to this repo (VoxCeleb_trainer).

FYI, due to my changed affiliation, I recently developed a RawNet3 reproducible recipe in ESPnet2, where I achieved EER of 0.73% with RawNet3 and it was reproducible several times when tested.

Dec 13 '23 16:12 Jungjee

Hi, @Jungjee , thank you for your quick reply! The work is impressive and the training recipe described in the paper is detailed. With your kind advice, we'll test the pretrained model with a duration of 4s and try training on ESPnet2 ^ ^

Dec 14 '23 01:12 JunLi0514

voxceleb_trainer voxceleb_trainer copied to clipboard

Cannot achieve paper's RawNet3 results using an official recipe

voxceleb_trainer
voxceleb_trainer copied to clipboard