wangchao
wangchao
重新尝试,多等了一会,前端有文档已完成加载的提示了,asking一直是...的等待状态。  后端程序还在运行,但是有如下报错: The dtype of attention mask (torch.int64) is not bool
更新很快啊,赞,已用最新鲜版本跑了,24core cpu,推理等了挺久,然后报错: python3 ./webui.py Explicitly passing a `revision` is encouraged when loading a model with custom code to ensure no malicious code has been contributed in a newer revision. Explicitly...
I completely agree. The user experience is not satisfactory when the response is returned all at once. Is it possible to support a streaming approach where we can see the...