whisper.cpp
whisper.cpp copied to clipboard
Port of OpenAI's Whisper model in C/C++
There seems to be significant interest for a voice assistant application of Whisper, similar to "Ok, Google", "Hey Siri", "Alexa", etc. The existing [stream](https://github.com/ggerganov/whisper.cpp/tree/master/examples/stream) tool is not very applicable for...
I have made binding for almost all of the functions and I am trying to get this working in Python but I am encountering errors. Some of this code is...
Thanks for the code! This is great! This script will: 1. grab 30s of audio from bbc radio 2. transcribe it 3. spit out transcribed text 4. repeat `bbc_blaster.sh` ```...
I've locally patched up recent binaries with this manifest ```xml true SegmentHeap ``` to make them use Windows 10/11+ new segmented heap (compatibility is for running them outside the vista...
ref #68, #188, #197 - Add the "max_initial_timestaamp" token logic from OpenAI - Disallow sampling timestamps that are in the past - Add fallback strategy when the timestamp token sampling...
When running whisper.cpp with e.g. `--max-len 77` I get some weird time codes. It does not happen when not using `--max-len`. Examples: ```` [00:34:35.820 --> 00:34:36.820] You built that with...
Is there a way to cancel a transcript that is in progress programmatically? Calling `whisper_free` results in the following error: ``` ggml_new_tensor_impl: not enough space in the context's memory pool...
I used whisper.cpp to process a whole tv series. It's detecting about 99% of the words said but subtitles are not getting any gap/interval between them and I don't want...
Whenever I run `make` I see the following output: ``` ❯ make Makefile:21: Your arch is announced as x86_64, but it seems to actually be ARM64. Not fixing that can...
Might be a stretch, but would it be possible to label different speakers if audio has >1 person talking? This would come handy for conference recordings with multiple presenters, etc.