ragflow icon indicating copy to clipboard operation
ragflow copied to clipboard

[Bug]: When parsing a knowledge base document, all parsed historical documents will be flashed as error messages

Open super-noodle opened this issue 10 months ago • 1 comments

Self Checks

  • [x] I have searched for existing issues search for existing issues, including closed ones.
  • [x] I confirm that I am using English to submit this report (Language Policy).
  • [x] Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) (Language Policy).
  • [x] Please do not modify this template :) and fill in all the required fields.

RAGFlow workspace code commit ID

null

RAGFlow image version

v.0.17.0

Other environment information

Ubuntu 
Vllm

Actual behavior

I have successfully parsed many documents, but during the most recent parsing process, there may have been an error in parsing one document, causing all previously parsed content to be flashed as error message content. But after that, the document can be parsed normally again. I am using the deepseek-r1-14b model, and the vector model uses the built-in bge-large-zh-v1.5. and all the error message is all the same.

We have a total of 198 documents to parse, all of which are docx versions, and the chosen parsing mode is laws. We have tried to parse all the documents at once, or parse every 4-5 articles as a batch, but we have encountered this problem in the end.

Image

How to solve the above problem?

Expected behavior

Can parse normally without any errors.

Steps to reproduce

We have a total of 198 documents to parse, all of which are docx versions, and the chosen parsing mode is laws.
After uploading the document, start parsing 4-5 documents and wait for the parsing to complete before viewing the parsed content.

Additional information

No response

super-noodle avatar Mar 12 '25 03:03 super-noodle

I guess you need to switch to another LLM for knowledge graph extractions.

KevinHuSh avatar Mar 12 '25 04:03 KevinHuSh