mlx-audio icon indicating copy to clipboard operation
mlx-audio copied to clipboard

Refactor STT API (common functions)

Open Blaizzy opened this issue 9 months ago • 2 comments

We need to refactor our Speech-to-Text (STT) API to extract common functions, eliminate code duplication, and create a more consistent interface across different components.

Requirements

  • Identify and extract common functions used across the STT pipeline
  • Create a shared utilities module for audio processing functions
  • Standardize function signatures and return types
  • Separate core functionality from model-specific implementations
  • Ensure backward compatibility with existing API consumers
  • Improve error handling and logging

Blaizzy avatar May 09 '25 18:05 Blaizzy

Done in #161

Blaizzy avatar May 24 '25 11:05 Blaizzy