tikaondotnet icon indicating copy to clipboard operation
tikaondotnet copied to clipboard

Tika not extracting table with Content control fields from word document

Open mistakenjockey opened this issue 7 years ago • 2 comments

Hi, I have a word document which contains normal tables and there are some table with content control. Tika extract's the text of document and content of normal table perfectly but skip the table which has content control over it. How to extract the data from table with content control .

"Content controls are individual controls that you can add and customize for use in templates, forms, and documents. "

mistakenjockey avatar Oct 03 '18 07:10 mistakenjockey

Sorry you are having problems. That part of Tika (Office document extraction) is controlled by POI. I'd take a look over there to see if they support the desired capability.

KevM avatar Oct 03 '18 17:10 KevM

Thanks for the reply. keep posted if you find something which can resolve the issue.

mistakenjockey avatar Oct 04 '18 06:10 mistakenjockey