crabml icon indicating copy to clipboard operation
crabml copied to clipboard

Support Qwen models

Open flaneur2020 opened this issue 1 year ago • 1 comments

as described in https://arxiv.org/pdf/2309.16609.pdf

the architectural differences between llama are:

Screenshot 2024-03-31 at 23 06 47

references: https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/models/qwen.py

flaneur2020 avatar Mar 31 '24 15:03 flaneur2020

#179 had added the qwen2 support, however we can close this issue after the chat template for qwen2 got adjusted

flaneur2020 avatar Apr 18 '24 04:04 flaneur2020