ipex-llm icon indicating copy to clipboard operation
ipex-llm copied to clipboard

[Windows] Qwen1.5-7B 性能优化

Open juan-OY opened this issue 1 month ago • 0 comments

运行过程中会对多个长输入回复总结,希望能在Qwen1.5-7B模型下进一步优化首字和rest token处理延时。

juan-OY avatar Jun 06 '24 13:06 juan-OY