LLM-Pruner
LLM-Pruner copied to clipboard
Adaption for Qwen3
Thank you for your solid work. I would like to ask if the code will be suitable for Qwen3 models, which added q_norm and k_norm in self_attention and would lead to not pruning self_attn layers.