PaddleOCR icon indicating copy to clipboard operation
PaddleOCR copied to clipboard

PPStructureV3处理多页PDF报错

Open zhangjun0506 opened this issue 3 months ago • 2 comments

🔎 Search before asking

  • [x] I have searched the PaddleOCR Docs and found no similar bug report.
  • [x] I have searched the PaddleOCR Issues and found no similar bug report.
  • [x] I have searched the PaddleOCR Discussions and found no similar bug report.

🐛 Bug (问题描述)

PPStructureV3处理多页PDF报错,当配置use_seal_recognition = True时候,处理多页PDF,如果有的页没有印章,则报错IndexError: list index out of range

🏃‍♂️ Environment (运行环境)

OS windows11 python 3.11 paddleOCR 3.2.0

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

from paddleocr import PPStructureV3

pipeline = PPStructureV3(enable_mkldnn=True, use_seal_recognition = True,) output = pipeline.predict("./watched_folder/LSZD.pdf") for res in output: res.print() ## 打印预测的结构化输出 res.save_to_json(save_path="output") ## 保存当前图像的结构化json结果 res.save_to_markdown(save_path="output") ## 保存当前图像的markdown格式的结果

zhangjun0506 avatar Oct 17 '25 08:10 zhangjun0506

感谢您的反馈,我们将优先排查此问题。请稍候,我们会尽快提供更新。

changdazhou avatar Oct 17 '25 12:10 changdazhou

和 #16644 是同一个问题

yinkaisheng avatar Oct 21 '25 01:10 yinkaisheng