ragflow icon indicating copy to clipboard operation
ragflow copied to clipboard

[Question]: How to improve parsing and chating speed?

Open wjjf opened this issue 1 year ago • 3 comments

[Question]: 您好,我在服务器上部署了一套,使用的是千问大模型,服务器16+200G。现在的问题是解析文档有时候会卡住,提问的话如果有答案通常是20s起步。我想知道我需要调整哪方面的参数能优化这一问题?我的性能瓶颈在哪里?

wjjf avatar May 17 '24 01:05 wjjf

-- We use visual model to parsing PDF, so it's slow. We're working on its speed. You can disable layout recognition for PDF in general parsing method. And this can be configured later in knowledgebase configuration.

-- About the chatting speed, we're gona use streamly chat which will bring speedy chating user experience. Today, it's gona release in docker images of dev version.

-- I'm not sure what the 16 refer to. If it's 16GB memory, I'm afraid it's not enough. 32GB will be better.

image

KevinHuSh avatar May 17 '24 01:05 KevinHuSh

Thanks!

wjjf avatar May 17 '24 02:05 wjjf

Now I'm using a machine with 32GB of RAM, but the answer speed is still twenty or thirty seconds, what parameters should I adjust to improve it?

wjjf avatar May 20 '24 08:05 wjjf