aes67-linux-daemon icon indicating copy to clipboard operation
aes67-linux-daemon copied to clipboard

Real time audio transcription with OpenAI's Whisper

Open bondagit opened this issue 7 months ago • 1 comments

This issue is to track the support of the AES67 daemon for real-time transcription of audio streams using OpenAI's Whisper, integrated through Whisper.cpp, a high-performance C/C++ inference of Whisper. The transcription feature enables speech-to-text conversion of daemon's configured Sinks with good robustness and accuracy, making it a valuable addition for multimedia and broadcast applications. Audio transcription feature has been integrated while maintaining robust performance in multi-sink setups by leveraging a multi-threaded architecture. See branch asr-whisper

bondagit avatar Jun 09 '25 11:06 bondagit

In case you are interested in Whisper on ALSA, checkout this separate project: https://github.com/bondagit/whisper-alsa

bondagit avatar Jun 13 '25 13:06 bondagit