[Bug]: Incorrect parsing PDF layout
Is there an existing issue for the same bug?
- [X] I have checked the existing issues.
Branch name
main
Commit ID
51efecf
Other environment information
No response
Actual behavior
使用带有表格pdf,或者左右布局的pdf,使用general的方式处理数据。
Expected behavior
No response
Steps to reproduce
使用带有表格pdf,或者左右布局的pdf,使用general的方式处理数据。
Additional information
No response
使用general,同时开启了layout的能力,布局识别经常不准确。
Which version? Online demo or local deployment?
线上和线下版本都有这个问题。版本是0.11。
0.9版本到0.11都有类似问题。导致pdf中使用layout的数据处理都有数据错位或者丢失的问题。
Lots of reasons will lead to error layout parsing. If possible, would you please attache the error data and screenshot in this issue?
By the way, we intend to create an international community, so we encourage using English for communication.
anything update?
I have the same issue:
original:
parsed:
It will be much better for the latest version.