stanbol
stanbol copied to clipboard
Bump tika-version from 1.5 to 1.22 in /parent
Bumps tika-version
from 1.5 to 1.22.
Updates tika-core
from 1.5 to 1.22
Changelog
Sourced from tika-core's changelog.
Release 2.0.0 - ??? BREAKING CHANGES in 2.0.0
- Remove deprecated Metadata keys/properties (TIKA-1974).
Other changes
Release 1.23
Upgrade to POI 4.1.1 (TIKA-2851).
Upgrade to PDFBox 2.0.17 (TIKA-2951).
Ensure that the PDFParser respects custom configuration of Tesseract from tika-config.xml via Eric Pugh (TIKA-2970).
Add parser for XLIFF v1.2 files (TIKA-2975).
Add mime type detection support for WebAssembly (TIKA-2894).
Add an XLZ Parser (TIKA-2976).
Release 1.22 - 07/29/2019
... (truncated)
NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints between 0xF000 and 0XF0000 will cause an exception.
Add parser for HWP v5 files via SooMyung Lee (soomyung) and JinSup Kim (ddoleye) (TIKA-2909).
Fix order of closing streams to avoid "Failed to close temporary resource" exception (TIKA-2908).
Improve AutoDetectReader performance by caching encoding detector (TIKA-1568).
Prevent RTFParser from outputting illegal tag combinations (TIKA-2889).
Fix RereadableInputStream to release all resources (TIKA-2903).
Implement custom language identifier in the tika-eval module based on OpenNLP's language detector; add 18 languages and add common words lists for all 121 languages (TIKA-2790).
Fix NPE in MimeTypesReader.releaseParser() via Eamonn Saunders (TIKA-2896).
Fix RTFParser to extract more content (TIKA-2883).
Add clientSubmitTime to the metadata extracted from PST files (TIKA-2898).
Commits
-
aa2a385
[maven-release-plugin] prepare release 1.22-rc4 -
de0fca9
roll back for rc#4...update date -
4db132e
roll back for rc#4 -
c5daaf4
Merge remote-tracking branch 'origin/branch_1x' into branch_1x -
357c163
include opennlp lang model in tika-eval during assembly -
0f3790e
[maven-release-plugin] prepare for next development iteration -
c23f47e
[maven-release-plugin] prepare release 1.23-rc3 -
c25b81d
Merge remote-tracking branch 'origin/branch_1x' into branch_1x -
fd40040
roll back for rc#3, again... -
950ee35
[maven-release-plugin] prepare for next development iteration - Additional commits viewable in compare view
Updates tika-parsers
from 1.5 to 1.22
Changelog
Sourced from tika-parsers's changelog.
Release 2.0.0 - ??? BREAKING CHANGES in 2.0.0
- Remove deprecated Metadata keys/properties (TIKA-1974).
Other changes
Release 1.23
Upgrade to POI 4.1.1 (TIKA-2851).
Upgrade to PDFBox 2.0.17 (TIKA-2951).
Ensure that the PDFParser respects custom configuration of Tesseract from tika-config.xml via Eric Pugh (TIKA-2970).
Add parser for XLIFF v1.2 files (TIKA-2975).
Add mime type detection support for WebAssembly (TIKA-2894).
Add an XLZ Parser (TIKA-2976).
Release 1.22 - 07/29/2019
... (truncated)
NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints between 0xF000 and 0XF0000 will cause an exception.
Add parser for HWP v5 files via SooMyung Lee (soomyung) and JinSup Kim (ddoleye) (TIKA-2909).
Fix order of closing streams to avoid "Failed to close temporary resource" exception (TIKA-2908).
Improve AutoDetectReader performance by caching encoding detector (TIKA-1568).
Prevent RTFParser from outputting illegal tag combinations (TIKA-2889).
Fix RereadableInputStream to release all resources (TIKA-2903).
Implement custom language identifier in the tika-eval module based on OpenNLP's language detector; add 18 languages and add common words lists for all 121 languages (TIKA-2790).
Fix NPE in MimeTypesReader.releaseParser() via Eamonn Saunders (TIKA-2896).
Fix RTFParser to extract more content (TIKA-2883).
Add clientSubmitTime to the metadata extracted from PST files (TIKA-2898).
Commits
-
aa2a385
[maven-release-plugin] prepare release 1.22-rc4 -
de0fca9
roll back for rc#4...update date -
4db132e
roll back for rc#4 -
c5daaf4
Merge remote-tracking branch 'origin/branch_1x' into branch_1x -
357c163
include opennlp lang model in tika-eval during assembly -
0f3790e
[maven-release-plugin] prepare for next development iteration -
c23f47e
[maven-release-plugin] prepare release 1.23-rc3 -
c25b81d
Merge remote-tracking branch 'origin/branch_1x' into branch_1x -
fd40040
roll back for rc#3, again... -
950ee35
[maven-release-plugin] prepare for next development iteration - Additional commits viewable in compare view
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase
.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
-
@dependabot rebase
will rebase this PR -
@dependabot recreate
will recreate this PR, overwriting any edits that have been made to it -
@dependabot merge
will merge this PR after your CI passes on it -
@dependabot squash and merge
will squash and merge this PR after your CI passes on it -
@dependabot cancel merge
will cancel a previously requested merge and block automerging -
@dependabot reopen
will reopen this PR if it is closed -
@dependabot ignore this [patch|minor|major] version
will close this PR and stop Dependabot creating any more for this minor/major version (unless you reopen the PR or upgrade to it yourself) -
@dependabot ignore this dependency
will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) -
@dependabot use these labels
will set the current labels as the default for future PRs for this repo and language -
@dependabot use these reviewers
will set the current reviewers as the default for future PRs for this repo and language -
@dependabot use these assignees
will set the current assignees as the default for future PRs for this repo and language -
@dependabot use this milestone
will set the current milestone as the default for future PRs for this repo and language
You can disable automated security fix PRs for this repo from the Security Alerts page.