speaker-diarization topic
diart
A python package to build AI-powered real-time audio applications
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
WatBot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
DNC
Discriminative Neural Clustering for Speaker Diarisation
SpeakerDiarization_RNN_CNN_LSTM
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze giv...
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition...