Kevin Miller
Kevin Miller
Not sure I follow what you are trying to do. Can you tell me more detail about what you need? There are examples of text extraction against many file types...
I don't believe that is an option that Tika offers. Tika is a high level abstraction. If you wanted to do that more precisely I would look at using POI...
You might be able to hook into Tika to get the raw markup. Not sure. Let me know what you find out.
Check out this issue #107
IMVM the basis of the library, as this is a java port, is not dotnet core compatible. On Fri, Feb 7, 2020 at 8:48 PM jfarbman wrote: > Not sure...
#152 Is tracking this
TBD: - [ ] `TikaOnDotnet.TextExtraction` should use nuspec or jave csproj properties to make the listing as nice as `TikaOnDotnet` - [ ] Move deployment automation to GitHub Actions.
Thank you!
There is one failing test for rtf files. No idea why it is not working. I was going to work on getting a pre-release out and then let people try...
So it sounds like there is still a benefit of our project doing once what people would otherwise need to do on every build. On Mon, Oct 24, 2022, at...