hubert topic
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Speech-Emotion-Recognition
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning on the RAVDESS dataset.
self-supervised-phone-segmentation
Phoneme segmentation using pre-trained speech models
Singing-Vocal-Beat-Tracking
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trai...
AI-Cover-Song
Cover Song Powered by SoftVC VITS
ShiftSER
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Map-Mix
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)