ipex-llm
ipex-llm
copied to clipboard
Published
20 hours ago
•
intel-analytics
Reame
Issues
[Windows] Qwen1.5-7B 性能优化
Open
juan-OY
opened this issue 1 month ago
• 0 comments
运行过程中会对多个长输入回复总结,希望能在Qwen1.5-7B模型下进一步优化首字和rest token处理延时。
Jun 06 '24 13:06
juan-OY