speech-processing topic
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Shelf
a Wide Shelf for AI and Data Science | Resources đ
SERAB
SERAB: a multi-lingual benchmark for speech emotion recognition
NLP-Guide
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
awesome-speech-emotion-recognition
đ Awesome lists about Speech Emotion Recognition
bob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
LIUM
Scripts for LIUM SpkDiarization tools
spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"
DiscordSpeechBot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
SpeechPrompt-v2
ăSpeechPrompt v2: Prompt Tuning for Speech Classification TasksăSpeech processing with prompting paradigm