PaddleOCR
PaddleOCR copied to clipboard
PPStructureV3处理多页PDF报错
🔎 Search before asking
- [x] I have searched the PaddleOCR Docs and found no similar bug report.
- [x] I have searched the PaddleOCR Issues and found no similar bug report.
- [x] I have searched the PaddleOCR Discussions and found no similar bug report.
🐛 Bug (问题描述)
PPStructureV3处理多页PDF报错,当配置use_seal_recognition = True时候,处理多页PDF,如果有的页没有印章,则报错IndexError: list index out of range
🏃♂️ Environment (运行环境)
OS windows11 python 3.11 paddleOCR 3.2.0
🌰 Minimal Reproducible Example (最小可复现问题的Demo)
from paddleocr import PPStructureV3
pipeline = PPStructureV3(enable_mkldnn=True, use_seal_recognition = True,) output = pipeline.predict("./watched_folder/LSZD.pdf") for res in output: res.print() ## 打印预测的结构化输出 res.save_to_json(save_path="output") ## 保存当前图像的结构化json结果 res.save_to_markdown(save_path="output") ## 保存当前图像的markdown格式的结果
感谢您的反馈,我们将优先排查此问题。请稍候,我们会尽快提供更新。
和 #16644 是同一个问题