Prince Canuma
Prince Canuma
I saw the same example! Not sure it's functional here yet. But this is possible and it's top of the list after I finish Orpheus port.
That would be awesome! We can achieve it with STT
Yes, training Orpheus is a possibility. I believe it can be done with the help of MLX-LM and some high-level utils for processing the audio output. In general, I'm thinking...
> NO idea how you keep up with so many things to do! :) Neither do I 🤣🙌🏽
> Literally so many new things coming out! > > On a sidenote does mlx-audio implement anything like this: > https://github.com/freddyaboulton/fastrtc > > Specifically @reach_vb mentioned this post on X:...
> Just saw one of the examples is also real time object detection. I wonder if it could be implemented in MLX-VLM specifically for on screen bounding boxes 🤔 >...
Thanks, much needed! Will keep that in mind :)
@charmaineem here is a great section for the docs. Inference examples for each model.
Hey @jrp2014 Thank you very much! What is your proposed solution? To clarify, the need to trust the code and deprecations warnings come from HF transformers. Regarding the models that...
Please share the command you used and the version of MLX-vlm