MP-SENet icon indicating copy to clipboard operation
MP-SENet copied to clipboard

Training on a more diverse dataset

Open nickhward opened this issue 6 months ago • 4 comments

Thank you for your paper!

I have been applying your model to a more diverse dataset consisting of approximately 3,000 speakers and around 1,000 hours of audio data. However, I have observed that the model's performance diminishes with such a diverse dataset. I am reaching out to ask if you have any recommendations or best practices for training the model to enhance its generalization capabilities, particularly when dealing with a wide variety of speakers and audio conditions.

I appreciate any advice or insights you could share.

Thank you!

nickhward avatar Aug 25 '24 16:08 nickhward