AQLM icon indicating copy to clipboard operation
AQLM copied to clipboard

Compatibility with Qwen models

Open AIpplier opened this issue 3 months ago • 1 comments

Hi,

I just wanted to confirm whether AQLM’s quantization methods can be applied to Qwen models, since they are part of the LLaMA-family architecture. Are there any known limitations or additional steps required for using AQLM with Qwen checkpoints?

Thanks!

AIpplier avatar Oct 16 '25 12:10 AIpplier

Hi, @AIpplier.

Llama and Qwen are very similar in terms of architecture, and it is likely that the code script working for Llama would work for Qwen without significant changes.

The potential cause of issues could be changes of Llama/Qwen implementations in more recent transformers versions. Specfically, RoPE is subject to major changes.

Godofnothing avatar Oct 17 '25 12:10 Godofnothing