Prince Canuma comments

Results 572 comments of


                                            Prince Canuma

Dia output audio is too fast

I opened a new issue for the pauses so can keep track and return to it.

MedGemma support

Hey @JoeJoe1313 You are correct! ✅ The multimodal should be converted using mlx-vlm and not mlx-lm Please feel free to re-upload using mlx-vlm, if you can't just ping me and...

MedGemma support

This is happening because new models are migrating to jinja files instead of json for chat template. I fixed it here #376. You can install from source for now it...

Add FastAPI server

This is awesome! I have some ideas, let's collaborate on a PR

Add support for prompt caching (image + video)

To solve #136 We can load KV cache ([attention sink](https://arxiv.org/abs/2309.17453)) and compute values for new image and prompt tokens. Note: 1. Save each image hash in the cached prompts/features for...

pip install mlx-audio not working

@Licyldj this should be fixed in v0.2.2

Model Request: ACE-Step

Hey @fakerybakery Thanks for reaching out and suggesting it! Please feel free to suggest new models anytime.❤️ It's definitely on the roadmap 🚀 Will start on the tts module initially...

Some picture captioning results with various models, for reference

Thanks! I will look into it. There is another issue reporting the same problem.

Some picture captioning results with various models, for reference

Closing stale

Add UI v2

I wonder that as well. But being closer to the project helps. If it grows in popularity then we can consider a separate repo. cc: @lucasnewman