speech-processing topic
SpeechGen
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
Praaline
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
WebSpeechAnalyzer
JS speech analyzer for fast speech analysis and labeling
formantanalyzer.js
Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a web browser using WebAudio API.
Nested-U-Net-based-Real-time-Speech-Enhancement-Mobile-App
Real-time speech enhancement mobile app using Nested U-Net
resemble-enhance
AI powered speech denoising and enhancement
PNCC
A implementation of Power Normalized Cepstral Coefficients: PNCC
itsp
Introduction to Speech Processing
llm-tse
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
whisper-auto-transcribe
Auto transcribe tool based on whisper