Prince Canuma

Results 63 issues of Prince Canuma

### Summary The latest mlx-lm (v0.28.1) introduces a regression in `generate_step` that breaks models using `input_embeddings`, specifically affecting the Voxtral STT pipeline in mlx-audio. ### The Bug In the prefill...

https://x.com/jkeshet/status/1986771354104877296?s=46