whisper.cpp icon indicating copy to clipboard operation
whisper.cpp copied to clipboard

Port of OpenAI's Whisper model in C/C++

Results 525 whisper.cpp issues
Sort by recently updated
recently updated
newest added

There seems to be significant interest for a voice assistant application of Whisper, similar to "Ok, Google", "Hey Siri", "Alexa", etc. The existing [stream](https://github.com/ggerganov/whisper.cpp/tree/master/examples/stream) tool is not very applicable for...

ideas

I have made binding for almost all of the functions and I am trying to get this working in Python but I am encountering errors. Some of this code is...

bindings

Thanks for the code! This is great! This script will: 1. grab 30s of audio from bbc radio 2. transcribe it 3. spit out transcribed text 4. repeat `bbc_blaster.sh` ```...

ideas

I've locally patched up recent binaries with this manifest ```xml true SegmentHeap ``` to make them use Windows 10/11+ new segmented heap (compatibility is for running them outside the vista...

performance

ref #68, #188, #197 - Add the "max_initial_timestaamp" token logic from OpenAI - Disallow sampling timestamps that are in the past - Add fallback strategy when the timestamp token sampling...

When running whisper.cpp with e.g. `--max-len 77` I get some weird time codes. It does not happen when not using `--max-len`. Examples: ```` [00:34:35.820 --> 00:34:36.820] You built that with...

bug

Is there a way to cancel a transcript that is in progress programmatically? Calling `whisper_free` results in the following error: ``` ggml_new_tensor_impl: not enough space in the context's memory pool...

enhancement

I used whisper.cpp to process a whole tv series. It's detecting about 99% of the words said but subtitles are not getting any gap/interval between them and I don't want...

Whenever I run `make` I see the following output: ``` ❯ make Makefile:21: Your arch is announced as x86_64, but it seems to actually be ARM64. Not fixing that can...

Might be a stretch, but would it be possible to label different speakers if audio has >1 person talking? This would come handy for conference recordings with multiple presenters, etc.