wavlm topic
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Singing-Vocal-Beat-Tracking
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trai...
audiocodecs
A collections of audio codecs with a standardized API
focalcodec
A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation
discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations