graphrag icon indicating copy to clipboard operation
graphrag copied to clipboard

[Bug]: graph_extractor miss entities while processing llm result

Open zqWu opened this issue 5 months ago • 0 comments

Do you need to file an issue?

  • [X] I have searched the existing issues and this bug is not already filed.
  • [ ] My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
  • [ ] I believe this is a legitimate bug, not just a question. If this is a question, please use the Discussions area.

Describe the bug

at workflow entity_extraction, after glean, the result from llm looks like

("entity"<|>避难层<|>建筑空间<|>建筑内用于人员暂时躲避火灾及其烟气危害的楼层)##
... other entities ...
<|COMPLETE|>("entity"<|>避难层<|>建筑空间<|>建筑内用于人员暂时躲避火灾及其烟气危害的楼层)##
... other entities ...
<|COMPLETE|>

after that, it will process the string to graph.

problem: it cannot handle
<|COMPLETE|>("entity"<|>避难层<|>建筑空间<|>建筑内用于人员暂时躲避火灾及其烟气危害的楼层)

suggest solution: pre_result + + glean_result or split result by complete_delimiter, then by record_delimiter

image

Steps to reproduce

No response

Expected Behavior

No response

GraphRAG Config Used

# Paste your config here

Logs and screenshots

No response

Additional Information

  • GraphRAG Version:
  • Operating System:
  • Python Version:
  • Related Issues:

zqWu avatar Sep 23 '24 04:09 zqWu