lovepeace-max issues

Results 6 issues of


                                            lovepeace-max

[Question]: 对于内嵌图片的pdf文件，pdf_parser没有处理图片的逻辑导致不能生成base_parser里extract_images_from_chunk函数所需要的img tag？

### 问题类别其他 ### 问题描述您好，我在看代码的时候，看到对于内嵌图片的pdf文件，pdf_parser有针对table的处理逻辑，但是没有针对图片处理逻辑，导致不能生成base_parser里extract_images_from_chunk函数所需要的*img* tag？ ### 背景信息 _No response_ ### 操作系统 macOS ### 其他环境信息 _No response_ ### 相关日志 ```shell ``` ### 已查找的资源 _No response_ ### 确认事项 - [x]...

question

[Question]: process_chunks_images函数中信号量固定为1的原因？

### 问题类别其他 ### 问题描述您好，我在看docreader的代码的时候，看到对于切分后的chunks，如果要对每一个chunk做图片的ocr识别，是通过process_chunks_images函数来执行的，但是函数中代码将并发量限制为1了，这么固定的原因是？如果机器配置比较好，这里是不是可以设置为更大的数？ ``` async def process_all_chunks(): # Set max concurrency, reduce concurrency to avoid resource contention max_concurrency = min(self.max_concurrent_tasks, 1) # Reduce concurrency # Use...

question

[BUG] Incomplete graph structure in Neo4j

## Bug Description I followed the official example in this [documentation](https://help.getzep.com/graphiti/getting-started/quick-start) and I installed Neo4j Desktop as graph database. But it seems to create incomplete graph structures as shown in...

bug

python

Feat: 接入Youtu-GraphRag

**#️⃣ 您的功能请求是否与某个问题相关？请描述。** 根据最新的[youtu-graphrag](https://github.com/TencentCloudADP/youtu-graphrag)论文来看，似乎比LighRAG效果要好很多，消耗的token数也少一些，不知道是否计划接入 **#️⃣ 描述您期望的解决方案** 增加youtu-graphrag

enhancement

Graph中存在同一实体重复多次(entity_node_id不一样)的问题

Hi, 在看代码的时候，发现有点问题: ``` def process_level1_level2(self, chunk: str, id: int): # Process attributes and triples attr_nodes, attr_edges = self._process_attributes(extracted_attr, id, entity_types) triple_nodes, triple_edges = self._process_triples(extracted_triples, id, entity_types) ``` 上面self._process_attributes和self._process_triples都会调用self._find_or_create_entity函数， ``` def...

agent/nonagent模式下图谱构建过程中的ETA都会不断增加

Hi, 在documents较多的时候，使用agent/nonagent两种模式时，图谱构建过程中的ETA都会线性增加，区别是agent耗时更多。不太理解nonagent模式为什么ETA也会不断增加，每个document的大小差别不大，除了schema，还会有什么在不断扩增导致耗时增加吗？多谢