bltcn comments

Results 26 comments of


                                            bltcn

[Translation Task] Yi README: Introduction + Models + News

assign it to me - review

非常感谢，已经测试通过，另外，想请教一下，如果想通过fastapi将其作为服务端接口提供出去，讲模型载入内存后常驻，请问如何做呢，是参考[deploy](https://github.com/mindspore-lab/mindocr/tree/main/deploy)/[py_infer](https://github.com/mindspore-lab/mindocr/tree/main/deploy/py_infer)/[example](https://github.com/mindspore-lab/mindocr/tree/main/deploy/py_infer/example) /ocr_infer_server.py这个嘛？

TensorRT后端能否生成engin文件，下次启动直接打开engin文件，不用每次生成。

请教一下，您是怎么做的？

当连续请求200多次后，出现突然卡住的情况

同样的问题

[Bug]8卡2080ti无法启动qwen2-72b-insctruct

> May try to "--max-batch-size 1" If it doesn't work, you may go for vLLM. It will take a while to optimize memory in LMDeploy. Don't let it to block...

[Bug] 最新版本的ollamo不兼容【从0.1.28升级到0.1.30】

需要在启动ollama的服务时，增加参数OLLAMA_ORIGINS=*

bltcn

最新的fastdeploy镜像版本是多少，怎么下载？

模型效果很差，是什么原因呢？

[BUG/Help] int8的版本哪儿下载

[Translation Task] Yi README: Introduction + Models + News

关于华为计算中心的昇腾设备无法运行本项目

能否提供一个在910b上进行pp-ocr v4部署的样例？

TensorRT后端能否生成engin文件，下次启动直接打开engin文件，不用每次生成。

当连续请求200多次后，出现突然卡住的情况

[Bug]8卡2080ti无法启动qwen2-72b-insctruct

[Bug] 最新版本的ollamo不兼容【从0.1.28升级到0.1.30】