speaker-recognition topics

NeMo

11.6k

Stars

2.4k

Forks

194

Watchers

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

NVIDIA

asr

deep-learning

language-model

machine-translation

speechbrain

8.0k

Stars

1.3k

Forks

Watchers

A PyTorch-based Speech Toolkit

speechbrain

asr

audio

audio-processing

deep-learning

uis-rnn

1.5k

Stars

318

Forks

Watchers

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

google

clustering

machine-learning

speaker-diarization

speaker-recognition

chatbot-watson-android

193

Stars

182

Forks

Watchers

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

IBM-Cloud

android

android-studio

chatbot

conversation

pyannote-audio

5.3k

Stars

705

Forks

Watchers

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pyannote

overlapped-speech-detection

pretrained-models

pytorch

speaker-change-detection

SincNet

1.1k

Stars

259

Forks

Watchers

SincNet is a neural architecture for efficiently processing raw audio samples.

mravanelli

artificial-intelligence

asr

audio

audio-processing

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...

speechbrain

beamforming

deep-learning

deeplearning

librispeech