WhisperKit issues

Fix crash with mic device sample rate mismatch

Previously the inputNode was using the sample rate based on the hardware that the app was running on. When selecting a microphone that is connected to the device but has...

ZachNagengast

WhisperKit CLI cleanup

Before taking on https://github.com/argmaxinc/WhisperKit/issues/36 I decided to do a little cleanup in the CLI - moved command line arguments to a separate `WhisperKitArguments` struct - extracted a separate `transcribe` subcommand...

jkrukowski

The app “WhisperAX” has been killed by the operating system because it is using too much memory.

**The app crashes after recording a few seconds of sound. It's being used on an iPhone 12 mini device that has been cold restarted, with Large-v2_1050MB.** ``` The app “WhisperAX”...

zxl777

Add `Progress` to `WhisperKit`

Currently there doesn't seem to be a way to get the current transcription progress to display. This adds a `Progress` to WhisperKit for easily displaying transcription progress in a ProgressView...

finnvoor

Updated swift-transformers and tokenizer changes

In this PR: - updated to use the 0.1.3 version of the `swift-transformers` - added param in CLI -- tokenizer config download path

jkrukowski

Implement test data-driven `unsupportedModelDeviceCombination` at init

2

After specifying a minimum OS version of macOS13 and iOS16, there is still a large matrix of possible model-device configurations for deployment: Devices have varying capabilities across: - **Available RAM:**...

atiorh

how to fix "Ambiguous use of 'transcribe(audioPath:decodeOptions:callback:)'"

i try many way, but still show:Ambiguous use of 'transcribe(audioPath:decodeOptions:callback:) Here is code: ```` func stopRecording() { audioRecorder?.stop() if let url = audioRecorder?.url { transcribeAudio(audioPath: url.path) } } func transcribeAudio(audioPath:...

zacfire

Added MLX Audio Encoder

This PR adds MLX Audio Encoder The implementation is based on the `AudioEncoder` from the `mlx-examples` repository. To make sure the audio encoder works as expected, I have added the...

jkrukowski

VAD audio chunking

5

This PR introduces audio chunking with VAD. The VAD is used to detect speech segments in the audio file and then the audio is split into chunks based on the...

jkrukowski

MLX model support

1

Draft PR for the early stages of supporting MLX based whisper models directly in WhisperKit. (To be updated) Initial TODOs: - [x] Setup swift package structure - [x] MLXFeatureExtractor using...

ZachNagengast

WhisperKit
WhisperKit copied to clipboard

Metadata

Fix crash with mic device sample rate mismatch

WhisperKit CLI cleanup

The app “WhisperAX” has been killed by the operating system because it is using too much memory.

Add `Progress` to `WhisperKit`

Updated swift-transformers and tokenizer changes

Implement test data-driven `unsupportedModelDeviceCombination` at init

how to fix "Ambiguous use of 'transcribe(audioPath:decodeOptions:callback:)'"

Added MLX Audio Encoder

VAD audio chunking

MLX model support

← Metadata

Owner

Metadata

WhisperKit WhisperKit copied to clipboard

Metadata

← Metadata

Owner

Metadata

WhisperKit
WhisperKit copied to clipboard