John comments

Results 7 comments of


                                            John

[BUG] 用fastchat加载vicuna-13b模型进行知识库的问答有token的限制错误

同样的问题，已经解决了吗？

[BUG] 用fastchat加载vicuna-13b模型进行知识库的问答有token的限制错误

> 同样问题老哥双卡能加载vicuna13吗？最终问题如何解决？我这边单卡能跑，双卡显然也能跑

[BUG] 用fastchat加载vicuna-13b模型进行知识库的问答有token的限制错误

> A lazy way to solve this is to add a line in fastchat.serve.openai_api_server.py line 233 with `conv["messages"] = []` after `conv = await get_conv(model_name)` 好像不能解决该问题

[BUG] 用fastchat加载vicuna-13b模型进行知识库的问答有token的限制错误

openai.error.APIError: Invalid response object from API: '{"object":"error","message":"This model\'s maximum context length is 2048 tokens. However, you requested 2228 tokens (1716 in the messages, 512 in the completion). Please reduce the...

[BUG] 用fastchat加载vicuna-13b模型进行知识库的问答有token的限制错误

另外，如果使用的本地知识库是程序自带的 samples，是可以work的。

[BUG] 用fastchat加载vicuna-13b模型进行知识库的问答有token的限制错误

> > > 当我开启fastchat的vicuna-13b的api服务，然后config那里配置好(api本地测试过可以返回结果)，然后知识库加载好之后(知识库大概有1000多个文档，用chatGLM可以正常推理)，进行问答时出现token超过限制，就问了一句hello； > > > 错误号如下：openai.error.APIError: Invalid response object from API: '{"object":"error","message":"This model's maximum context length is 2048 tokens. However, you requested 2359 tokens (1847 in the messages,...

[cuFFTMp sample]c2c_pencils and r2c_c2r_pencils "Signal: Bus error (7)" while make ; make run

Note： This problem will occur in the K8S container, but it will not appear when running directly on the physical machine