docling icon indicating copy to clipboard operation
docling copied to clipboard

feat: translate equations to latex when running MSWord backend

Open rateixei opened this issue 11 months ago • 3 comments

At the moment, equations present in MSWord documents are not exported. This PR translates the MSWord equations to latex and includes it in the text output. New test files have been added to test this feature.

Checklist:

  • [ ] Documentation has been updated, if necessary.
  • [ ] Examples have been added, if necessary.
  • [ ] Tests have been added, if necessary.

rateixei avatar Jan 28 '25 14:01 rateixei

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🔴 Require two reviewer for test updates

This rule is failing.

When test data is updated, we require two reviewers

  • [ ] #approved-reviews-by >= 2

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • [X] title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

mergify[bot] avatar Jan 28 '25 14:01 mergify[bot]

@rateixei I see that the CI tests are failing, can you please re-generate the test GT? We need to see if it passes after that. Also, please rebase from main.

cau-git avatar Jan 31 '25 10:01 cau-git

Note: We will hold off with merging this until the design proposal for inline styles is implemented: https://github.com/DS4SD/docling/discussions/894

cau-git avatar Feb 07 '25 15:02 cau-git

@rateixei Now that we have the inline Formulas and Code, can you update this PR with using the right INLINE group?

PeterStaar-IBM avatar Feb 27 '25 13:02 PeterStaar-IBM