WhisperKit issues

Excessive Hypothesis Words in Non-English Audios

1

Hello, I’m experiencing an issue with transcription in non-English streams, particularly in Russian, where the number of hypothesis words generated is disproportionately large compared to confirmed words. The hypothesis words...

rk-helper

triaged

MLX performance improvement with in-place KVCache

#124 adds MLX support. We need to adopt the [in-place KV cache update](https://github.com/ml-explore/mlx-swift-examples/blob/main/Libraries/LLM/KVCache.swift#L9) pattern to improve performance.

atiorh

Decreasing Speed and Delayed Confirmation in Stream Transcription Over Time

7

I’ve been using WhisperKit for real-time stream transcription in a project, and I’ve noticed that as time progresses, particularly after 20-30 minutes of continuous use, the transcription speed begins to...

gavin1818

Excessive Loading Time for Each Transcription with whisperkit-cli

2

**Environment:** - MacBook Pro 14 (2021) 16GB - Apple M1 Pro - macOS 14.2.1 (23C71) **Issue Description:** I compiled whisperkit-cli using the whisperkittools pipeline. Subsequently, I noticed that each time...

NextDoorLaoHuang-HF

resampleBuffer may failed because the calculated capacity is less than 1

2

https://github.com/argmaxinc/WhisperKit/blob/3ebfa142a0e181668882e8e1c54088a528e2907b/Sources/WhisperKit/Core/Audio/AudioProcessor.swift#L416-L426

Josscii

bug

How to support auto language

1

In OpenAI’s Whisper model, setting the language to “auto” allows for automatic language detection, enabling multilingual transcription. However, in WhisperKit, this “auto” setting doesn’t seem to function as expected. Could...

kimilee7788

mp3 (attached) results in "Failed to process audio buffer" issue

2

Hi all, This mp3 file seems to trip WhisperKit (and latest Testflight WhisperAX on macOS) up, just wanted to let you know :) I haven't dived much further though mp3...

iandundas

bug

triaged

"Invalid semantic version string" when referencing version tag in SPM package

4

When referencing WhisperKit from a SPM package as a dependency, it does not recognize the "v" prefixed version as a version string (for both from and exact). The error in...

wooster

documentation

iPhone 12 Mini Crashes During Long Audio Transcription in WhiserAX App

The iPhone 12 mini crashed while using the latest version of the WhiserAX app on TestFlight to transcribe an 11-hour audio import. Selected model: whisper-small.en

zxl777

WhisperKit
WhisperKit copied to clipboard

Metadata

Excessive Hypothesis Words in Non-English Audios

MLX performance improvement with in-place KVCache

Decreasing Speed and Delayed Confirmation in Stream Transcription Over Time

Excessive Loading Time for Each Transcription with whisperkit-cli

resampleBuffer may failed because the calculated capacity is less than 1

How to support auto language

mp3 (attached) results in "Failed to process audio buffer" issue

"Invalid semantic version string" when referencing version tag in SPM package

iPhone 12 Mini Crashes During Long Audio Transcription in WhiserAX App

← Metadata

Owner

Metadata

WhisperKit WhisperKit copied to clipboard

Metadata

← Metadata

Owner

Metadata

WhisperKit
WhisperKit copied to clipboard