WhisperKit
WhisperKit copied to clipboard
On-device Speech Recognition for Apple Silicon
Like OpenAI's Whisper, is it possible to pass a text prompt which could be used to improve the quality of the transcript? See https://cookbook.openai.com/examples/whisper_prompting_guide#pass-names-in-the-prompt-to-prevent-misspellings Laurent
Implement tests to transcribe long audio files (at least several minutes worth) and measure the memory and latency over time. This is to guard against memory leaks or slowdowns potentially...
The goal is to leverage the high-quality word-level timestamps added in #38 as anchors to reliably seek the audio buffer forward at a higher frequency compared to current behavior: -...
Addresses #13 + adds audio device view in both sections. 1. New delete model button 2. Restart download of model if connection lost 3. Move audio input selection between the...
Occasionally Im seeing an `index out of range` crash on the `segmentLogProbs[index]` after a long period of silence. https://github.com/argmaxinc/WhisperKit/blob/main/Sources/WhisperKit/Core/TextDecoder.swift#L518-L521 `Swift/ContiguousArrayBuffer.swift:600: Fatal error: Index out of range` Two ways I could...
I am getting this error when trying to start WhisperKit in any simulator. Can someone say what could it be and how to fix? *** Terminating app due to uncaught...
How to fix this issue? Task . HTTP load failed, 0/0 bytes (error code: -1200 [3:-9816]) Task . finished with error [-1200] Error Domain=NSURLErrorDomain Code=-1200 "An SSL error has occurred...
 error message Could not launch “WhisperAX” Domain: IDELaunchErrorDomain Code: 20 Recovery Suggestion: Runningboard has returned error 5. Please check the system logs for the underlying cause of the error....