mergekit
mergekit copied to clipboard
Error at MoE Qwen 1.5B
mergekit-moe config.yaml merge --copy-tokenizer --device cuda --low-cpu-memory --trust-remote-code
ERROR:root:No output architecture found that is compatible with the given models.
ERROR:root:All supported output architectures:
ERROR:root: * Mixtral
ERROR:root: * DeepSeek MoE
ERROR:root: * Qwen MoE
I have the latest version of mergekit, I use only Qwen2 models and only 1.5B weight, no custom code.