wespeaker icon indicating copy to clipboard operation
wespeaker copied to clipboard

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Results 14 wespeaker issues
Sort by recently updated
recently updated
newest added

Thank you for uploading pre-trained ECAPA-TDNN model. For speaker diarization, the spectral clustering algorithm used by wespeaker uses the p-neighbor binarization scheme, and "p" should be choosed by people. I...

I'm benchmarking speaker embedding models, for speaker verifications, that are trained and tested on the [voxceleb1 dataset](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html). I am referring to the pre-trained models list [here](https://github.com/wenet-e2e/wespeaker/blob/master/docs/pretrained.md) but it looks like...

Hi, there is a recipe to follow on how to train on a new Language? Thanks a lot

Thanks for this repo.Excellent recognition results! Looking forward to open source code for Android platform deployment.

enhancement

greatly thanks to the project of speaker verification. in VoxSRC 2023, many team use QMF and ASnorm to improve the score, seems QMF is as good as ASnorm, if it...

enhancement

I'm currently in the process of training a model and have been tracking the loss and accuracy metrics. However, I've noticed that while I can calculate these metrics for the...

good first issue

How can i fine tune the pretrained model with my own audio files for speaker verification? There is no particular tutorial for it and i'm lost

It would be nice if you could `extract_embedding `with a `batch_size` parameter. Also accepting Numpy arrays or torch tensor instead of only file like types would be nice.

enhancement

Hi, I know there is [a plan to add 3D-Speaker](https://github.com/wenet-e2e/wespeaker/blob/efe6df2f0c6a5f76b3fa9092381d340a7c1e9a9b/ROADMAP.md?plain=1#L18). Is there any date for this issue? Are you planning to release pre-trained models on 3D-Speaker ?

Hi, is it possible to make a batch containing M speaker and N utterances for each speaker?

good first issue