MinerU icon indicating copy to clipboard operation
MinerU copied to clipboard

feat(model inference): add table recognition and conversion to LaTeX

Open papayalove opened this issue 1 year ago • 1 comments

What's Changed

New Features

  • Add table content recognition, we use weights of StructEqTable to convert table images to LaTex.

Instruction

  • pip install pypandoc struct-eqtable==0.1.0
  • Download StructEqTable weights and put it under 'models/' directory.
  • Edit the 'table-mode' value to turn on the table recognition function which is turned off by default.
  • If you did not download any models before, refer to how to download models

papayalove avatar Aug 01 '24 08:08 papayalove

CLA Assistant Lite bot:
Thank you for your submission, we really appreciate it. Like many open-source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution. You can sign the CLA by just posting a Pull Request Comment same as the below format.


I have read the CLA Document and I hereby sign the CLA


0 out of 2 committers have signed the CLA.
:x: @liukaiwen
:x: @papayalove
liukaiwen seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
You can retrigger this bot by commenting recheck in this Pull Request

github-actions[bot] avatar Aug 01 '24 08:08 github-actions[bot]