mlx-audio

mlx-audio copied to clipboard

Reame
Issues

SparkTTS Voice cloning (Wav2vec)

Open Blaizzy opened this issue 9 months ago • 0 comments

[x] Add Wav2vec model as STT (ASR and Audio classification)
[ ] The torch mel seems to work better than current
[ ] Fix model skipping the first few words

May 08 '25 01:05 Blaizzy