Nightguarder
Nightguarder
Sorry Not familiar with ``nix``. Will look into it. I just wanted to quickly get working on the project.
Hi, I have successfully added a new Feature: **Testing custom MLX models** Can Someone please clone & run my fork to verify downloading a larger model like [mlx-community/gpt-oss-20b-MXFP4-Q8](https://huggingface.co/mlx-community/gpt-oss-20b-MXFP4-Q8/tree/main)? I don’t...
I hope this is something we wanted. Currently only for testing purposes.
Not sure why my VSCode Prettier auto `prettified` all the files I’ve changed. I will probably create a new clean PR where I only change the required code blocks, to...
> Looks good! I wonder if we should directly add the model to the model cards instead of a separate KNOWN_MODELS but there's wider questions to be answered in there....
> Ok - gpt-oss-20b-MXFP4-Q8 did not work, but the download was completely fine, seems like an upstream problem. Yes I see the erorr. this might be more difficult than I...
Ok it’s working. GPT-OSS- model loaded. However I had to adedd _TEMPORARY_ overrides as in my commit: [2e446ab](https://github.com/exo-explore/exo/pull/937/commits/2e446ab3969904f77d2cfdeb9c7f6efb8d4c0637) Not ideal, we need to wait for official mlx support version.
GPT-oss-20b has no chat_template.jinja resulting in artifacts and instructions appearing in chat: ``` QUERY Hello EXO 09:25:43 TTFT 555ms•70.7 tok/s analysisWe need to be helpful, concise, no reasoning inside answer....
Hi, I have removed the specific memory overrides for `gpt-oss-20b` model. Can I now request a review from Developer / Maintainer for this PR: #937 ? Thank you
Hi, I’ve been sort of implementing this on my own fork of exo. How can I help with this?