ragflow icon indicating copy to clipboard operation
ragflow copied to clipboard

[Question]: Retrieval is very slow, and time is mainly spent on generating answers and tune question.

Open lizhao-8202 opened this issue 10 months ago • 5 comments

Describe your problem

ragflow: v0.15.1-slim chat model: qwen2.5:14b embeding model: bge-m3

Retrieval too slowly,This is the time spent on each step

Image

I hope to optimize the response speed, but I don't know how to start. Can I adjust the docker images startup parameters? If possible, which components and parameters should be adjusted

lizhao-8202 avatar Feb 18 '25 07:02 lizhao-8202

the same problem for me. It's so frustrating! The retrieval speed is making ragflow unusable.

stevenguan08 avatar Feb 18 '25 08:02 stevenguan08

Do you use re-rank model here?

KevinHuSh avatar Feb 19 '25 04:02 KevinHuSh

the same problem for me. I'm also encountering the same problem. Sometimes the same problem is relatively fast, and sometimes it's very slow. May I ask if it has anything to do with the API I'm using: Currently,
chat api uses the online siliconflow.cn (deepseek_v3), embedding model api: siliconflow.cn (BAAI/bge-m3)

zhouyangit avatar Feb 19 '25 06:02 zhouyangit

Do you use re-rank model here?

Not useing re-rank model

lizhao-8202 avatar Feb 19 '25 07:02 lizhao-8202

How many chunks do you estimate it has?

KevinHuSh avatar Feb 19 '25 07:02 KevinHuSh

Same to me.

Total: 5999.8ms Check LLM: 6.8ms Create retriever: 2.2ms Bind embedding: 68.4ms Bind LLM: 70.3ms Tune question: 2462.6ms Bind reranker: 0.0ms Generate keyword: 0.0ms Retrieval: 1501.1ms Generate answer: 1888.5ms

I have two PDF documents that aren't large. Is there a way to reduce the Tune question time?

chat19 avatar Mar 27 '25 15:03 chat19

Image

Disable it. @chat19

KevinHuSh avatar Mar 28 '25 03:03 KevinHuSh

现在可以了。感谢

lizhao-8202 avatar Mar 31 '25 01:03 lizhao-8202