mergekit icon indicating copy to clipboard operation
mergekit copied to clipboard

Error at MoE Qwen 1.5B

Open ehristoforu opened this issue 6 months ago • 2 comments

mergekit-moe config.yaml merge --copy-tokenizer --device cuda --low-cpu-memory --trust-remote-code
ERROR:root:No output architecture found that is compatible with the given models.
ERROR:root:All supported output architectures:
ERROR:root:  * Mixtral
ERROR:root:  * DeepSeek MoE
ERROR:root:  * Qwen MoE

I have the latest version of mergekit, I use only Qwen2 models and only 1.5B weight, no custom code.

ehristoforu avatar Aug 12 '24 17:08 ehristoforu