dkpro-core
dkpro-core copied to clipboard
Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
Move bibliography references from JavaDoc into OMTD-SHARE annotations (?) and pick them up from there as well when generating the reference documentation (?)
Handle discontinuous segments in brat format. For the moment, at least ignore them/warn.
Currently, we only have very few of the UDPipe models available via Maven packages. Also package the remaining ones.
- [x] Process DKPro2Gate - [ ] Process Gate2DKPro
- [ ] drop `id` and base all actions on `groupId/datasetId/version/language/mediaType` - [ ] migrate UD dataset to `DatasetFactory` - problem here is that the DS is very large and...
- [ ] add mapping support for dependencies - [ ] record tagset See also: - Issue #606
Enhance build.xml files to include a LICENSE.txt file in the packaged model JARs
Cf. #1029 **StanfordSegmenter** - [X] handle `ALWAYS` - [ ] handle `TWO_CONSECUTIVE` **CoreNlpSegmenter** - [ ] handle `ALWAYS` - [ ] handle `TWO_CONSECUTIVE`
**Describe the refactoring action** Update dependencies. - versions-maven-plugin 2.16.2 -> 2.20.1 - gmavenplus-plugin 3.0.2 -> 4.2.1 - license-maven-plugin 4.4 -> 5.0.0 - maven-checkstyle-plugin 3.3.1 -> 3.6.0 - checkstyle 10.17.0 ->...
Bumps [org.apache.tika:tika-core](https://github.com/apache/tika) from 2.9.2 to 3.2.2. Changelog Sourced from org.apache.tika:tika-core's changelog. Release 4.0.0-BETA1 - ??? BREAKING CHANGES Moved towards default json based configuration (TIKA-4544 and many others). tika-pipes implementation modules...