Fred Fan
Fred Fan
同问,有时候不切换到别的页面也有此类问题,如下图。容器版本:qanything:v1.0.8(因为v1.1.1用不了,报错“Triton Inference Error (error_code: 4)”,所以切回此版本) data:image/s3,"s3://crabby-images/cba4c/cba4c3ab61e693aaeeffd5f429ca494873e20fdf" alt="image"
Has this issue been solved by yourselt? I meet it in my host which is CentOS 8.5 with the kernel version 4.18.0-348.el8.x86_64.
> dnf install -y elfutils-devel This proposal works! Thank you!
有没有遇到在进行模型回退的时候git命令卡住的情况呢 ` git reset --hard 79b3da3bbb35406f0b2da3acfcdb4c96c2837faf` > > 2024.1.30, A100,也遇到了这个错误Triton Inference Error (error_code: 4),最新的代码,版本是1.1.1。 > > 我下载master的代码,和v1.1.1的代码。尝试了将docker-compose-xxx.yaml中的freeren/qanyxxx:v1.0.9改为freeren/qanyxxx:v1.0.8,1.1.1,1.1.0,1.0.7,都没有成功。要不就是问问题,显示出错了;要不就是Triton Inference Error (error_code: 4);要不一直卡在The triton service is starting up, it can be long......
同问,使用v1.1.1,使用A800的卡,Qwen-7B模型,部署效果与官网有明显差距,请官方给出建议,如何能提升部署后的问答效果
这方面是否有进展,谢谢
两位的问题有何进展,我遇到同样的问题,我的情况是使用了新的第三方LLM Yuan-2.0,启动成功,之后提问所有问题都出现这种情况,怀疑可能是兼容性问题,比如QAnyting为Yuan-2.0准备的模版不合适。 INFO:httpx:HTTP Request: POST http://localhost:7802/v1/chat/completions "HTTP/1.1 200 OK" INFO:root:Error calling API: 'NoneType' object is not subscriptable rerank_server.log提示应该是获取到了正确答案: > local rerank query: 韦小宝住在哪里 > local rerank passages number: 2 >...
补充一下,通过API接口调用发现内容返回正常,就是页面提示错误: ``` curl --location 'http://47.93.62.12:1234/api/local_doc_qa/local_doc_chat' --data '{ "user_id": "zzp", "kb_ids": ["KB5b72ff22213b42a7a6a1a9683b08baa5"], "question": "韦小宝住在哪里", "history": [] }' {"code":200,"msg":"success chat","question":"韦小宝住在哪里","response":"data: [DONE]\n\n","history":[["韦小宝住在哪里","Error code: 400 - {'object': 'error', 'message': '**NETWORK ERROR DUE TO HIGH...