WhisperKit
WhisperKit copied to clipboard
On-device Speech Recognition for Apple Silicon
Hey guys! This looks great, unfortunately I'm having issues loading the models (both in my own code and the sample app) I'm running this on an M1 Macbook Pro. Many...
the example is unable to run on iphone 11 pro. (The example is running good on Mac m1 max ) The following is the screen shot on iphone 11 pro....
It looks like the model is deleted when I use FIleManager removeAt method but when I re-run the project the deleted model appears again. FileManager.default.removeItem(at: URL.init(string: "file://" + "\(path)")!)
Hi folks, just wanted to check in and ask what would be entailed in adding support for older mac versions, such as 13.0?
the advantage of this project is that it uses CoreML for a performance gain, so showing benchmarks would solidify how much this advantage is
It would be worth adding support for ReactNative apps using Native Modules and expose Swift APIs to JS.
Segment level timestamps look good, great work guys. Are token level timestamps currently supported somehow, or on the roadmap?
https://github.com/argmaxinc/WhisperKit/blob/fed90c7c0727eb9f460d2a1226e0cbd5abf75141/Sources/WhisperKit/Core/AudioProcessor.swift#L197-L217 Creating an `AVAudioPCMBuffer` for the whole input audio buffer can easily surpass iOS memory limits. Attempting to transcribe a 44100hz, 2 channel, ~1hr long video crashes on iOS due...
I'm trying the demo app on a MacBook Pro with Apple M1 Pro and 16 GB memory. The `large-v3_turbo_1049MB` model has been specializing for more than 30 minutes, but `aned`...
Models are taking really good time to download. If the WiFI is OFF for a second it gets struck - it would be great if the example has two more...