Eric Buehler
Eric Buehler
@scottwey thanks you for the testing! Great to see the 100% -> ~1% utilization :). If you could resolve the conflicts, we can absolutely merge.
@misureaudio that makes sense. I'll merge a nice solution!
@misureaudio @DenisBobrovskiy I just merged #878 which only compiles & runs the Marlin kernels if the compute cap is appropriate, can you please confirm if it works?
@misureaudio this was fixed in #944 .
@gqf2008 @chrootchad can you please share the command you used to reproduce this?
@maslovw thanks for the issue! I opened #846 which should fix the build errors. Can you please run ``` git pull git switch metal_f8_bugfix ``` And then retry the build?
@maslovw thank you for confirming! I merged #846, please feel free to reopen.
@Sherlock-Holo I pushed [87a7c23](https://github.com/EricLBuehler/mistral.rs/commit/87a7c2368a61767a55e0f361b29c66792219c1c4) which fixes the loading. It looks like the chat template for these models in the GGUF file migth be incorrect, as it does not match the...
Thanks for the issue and the example. I will take a look!
Friendly ping @Jeadie