dkpro-core
dkpro-core copied to clipboard
Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
**Describe the refactoring action** - ivy 2.5.1 -> 2.5.2 - jtok 2.1.19 -> 2.1.21 - spring 5.3.27 -> 5.3.30 - webanno-tsv code from INCEpTION 29.x - junrar 7.5.4 -> 7.5.5...
``` The build.xml for the treetagger module is rather complex , special, and not up-to-date with current standards: - it still uses macros 0.4.0 - it does not separate into...
Refactor the `BratReader` and `BratWriter` such that the main conversion code is extracted into plain Java classes which can be used without having to run a pipeline (cf. `DKPro2Lif` etc.)....
Some components (see e.g. https://github.com/dkpro/dkpro-core/blob/de.tudarmstadt.ukp.dkpro.core-1.9.2/dkpro-core-io-cermine-gpl/pom.xml) appear to have `SNAPSHOT` versions even within DKPro releases (at least in 1.9.0, 1.9.1 and 1.9.2). This leads to problems with gitflow which rejects SNAPSHOTS...
``` The organization in the textnormalizer module is slightly chaotic... trying to improve this. ``` Original issue reported on code.google.com by `richard.eckart` on 2014-12-04 20:56:15
``` Currently, configure() from ResourceObjtectProviderBase throws only a IOException()... it should throw more fine-grained exceptions, e.g. because a mapping was not found, because a resource was not found, because a...
``` Clean up textnormalizer. There's lots of odd things including printing to stanrdard output, swallowing exceptions, dumping stacktraces, etc... ``` Original issue reported on code.google.com by `richard.eckart` on 2014-07-28 20:55:59
``` The Sonatype OSS Maven Repository to which we delpoy now requires all project POMs to contain certain elements: https://docs.sonatype.org/display/Repository/Sonatype+OSS+Maven+Repository+Usage+Guide#SonatypeOSSMavenRepositoryUsageGuide-6.CentralSyncRequirement DKPro Core is currently missing and . These elements should...
``` TGrep2 binaries should be in separate Maven artifact. They should also not be in SVN. Need to create a build.xml and put the binaries somewhere the build.xml can pick...
With UIMAv3, the CasDumpWriter doesn't make too much sense anymore. The new toString() method format truncates the output after a certain indentation causing information to be lacking from the dump....