mlx-audio icon indicating copy to clipboard operation
mlx-audio copied to clipboard

SparkTTS Voice cloning (Wav2vec)

Open Blaizzy opened this issue 9 months ago • 0 comments

  • [x] Add Wav2vec model as STT (ASR and Audio classification)
  • [ ] The torch mel seems to work better than current
  • [ ] Fix model skipping the first few words

Blaizzy avatar May 08 '25 01:05 Blaizzy