exo
exo copied to clipboard
[feature] deepseek-r1 Dynamic Quantized versions add
The dynamic quantizd version of DeepSeek-R1 is in GGUF format, but the support for llama.cpp is still in working, see: #167 . You could use MLX and tinygard model in exo currently.