Chinese-LLaMA-Alpaca
Chinese-LLaMA-Alpaca copied to clipboard
如何能让模型以stream方式输出问答?
合并lora权重后的模型,推理启动后,都是一次性输出回答,如何配置,或者修改代码,能够使得模型以一个字一个字的方式输出回答呢?
暂时不支持,可以看看huggingface是否有类似的实现
已经收到您的邮件,谢谢!
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your consideration.
Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.