python-documentai-toolbox
python-documentai-toolbox copied to clipboard
fix: Add Backwards Compatibility for incorrect hOCR Format
- Fixes #193 - Adds
inline_words
parameter toexport_hocr_str()
to support previous (incorrect) format.
Let's hold this, pending some internal discussions. The only situation when we support and maintain multiple templates should be if they all conform to the hOCR spec, e.g. different spec version, or the spec allows some variations and there are multiple common and popular options by the users.
Closing out, should no longer be needed