PaddleNLP
PaddleNLP copied to clipboard
[LLM] Support QWEN BlockAttention && PTQ
PR types
New features
PR changes
Models
Description
Support QWEN BlockAttention && PTQ