dkpro-core
dkpro-core copied to clipboard
Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
**Describe the refactoring action** Remove NLP4J module. **Expected benefit** Less code to maintain.
**Describe the refactoring action** Drop ClearNLP module. **Expected benefit** Less code to maintain.
This looks like a very nice project with lots of effort put into it. I really appreciate that. However the documentation is severely lacking. There is no usable Getting Started...
Refactor conversions into more reusable/uniform units. E.g. have classes called "X2Y" with a method "convert(source, target)".
Sometimes it is difficult/infeasible to limit number of output CAS by only using include/exclude patterns. Such as a folder with many similarly named files, or zip files, or when each...
Some time ago I had a discussion with @reckart about a project I am working on where we need paragraph- or even sentence-level language annotations in our document. Then the...
Add support for document-level key-value metadata. I imagine something like this: ``` === Variant 1 MetaDataEntry extends Annotation { String: key String: value } // Simplest option only allowing String...
Better support for CoNLL-U v2: - [ ] document boundary (`# newdoc`) - [ ] document ID (`# newdoc id = mf920901-001`) - [ ] paragraph boundary at sentence beginning...
It would be nice if DKPro would support a score feature of some sorts. The tools then can store the prediction probabillity in this field. It would be useful for...
Available here: http://macss.dfki.de/data/dependency_parser/UD_German_Clinical_retrain_250_0.gz