unstructured
unstructured copied to clipboard
docx: include embedded images as Image elements
An author can embed one or more images in a Word document.
Extract those during partitioning and include them in the element stream as an Image element if the partition strategy is "hi_res".
@scanny - Have you done this for Word docs already or is that still in progress?
@MthwRobinson still WIP.
Added in the API a couple months back.