ragflow icon indicating copy to clipboard operation
ragflow copied to clipboard

[Bug]: Keywords and questions should not be left out, especially the last dozen or so chunks.

Open lonrencn opened this issue 10 months ago • 1 comments

Is there an existing issue for the same bug?

  • [x] I have checked the existing issues.

RAGFlow workspace code commit ID

commit 5fdfb8d465826bf27bff61a6c5fb5f8340b0b5cc

RAGFlow image version

v0.16.0-177-g5fdfb8d4 slim

Other environment information

ubuntu 24
ollama qwen2.5:32b
bgm-m3

Actual behavior

I have been conducting tests for several days. When using academic monographs to create a knowledge base, I noticed that in the process of assigning keywords and questions to the chunks, the first 20 chunks are often omitted. That is, when I open the document and check the first and second pages of the chunks, there are no keywords or questions. The parsing settings are such that everything is enabled except for the knowledge graph.

Expected behavior

https://567.daoson.top:8443/#s/_U6PL34A ---> test document

Steps to reproduce

normal control

Additional information

No response

lonrencn avatar Mar 02 '25 02:03 lonrencn

I did not reproduce that. Is that possiblly caused by unstable LLM invokation? How to reproduce that? What's the chunking method?

KevinHuSh avatar Mar 03 '25 04:03 KevinHuSh