Andriy Mulyar issues

Results 28 issues of


Andriy Mulyar

Integrate MetamapLite as a component

[MetaMapLite](https://academic.oup.com/jamia/article/24/4/841/2961848) looks promising - currently MetaMap is very bulk. This would be a fascinating direction to explore and would make the package much more robust.

enhancement

[FEATURE REQUEST] Functionality for analyzing the differences between two Annotation objects.

**What problem does your feature solve?** A method to do analysis of annotations (namely for the application of looking at differences between gold and predicted annotations). **Describe the solution you'd...

enhancement

Refactor to directly use JSON output from Metamap2016

The newest version of [Metamap now supports JSON format](https://metamap.nlm.nih.gov/Docs/FAQ/JSON.pdf) - update the [Metamap](https://github.com/NanoNLP/medaCy/blob/master/medacy/pipeline_components/metamap/metamap.py) wrapper to directly parse this information. It currently gathers the XML and manually turns it into JSON...

enhancement

Requires Code Refactoring

Create flow diagrams and figures describing how the medaCy works.

enhancement

good first issue

Create a docker container with medaCy set up for easy first use

[FEATURE REQUEST] Option for label ranking in outputs

**Description** A CRF produces label probability outputs. Currently, we are simply using the highest probability label as the predicted entity label. It would be useful to allow for an option...

enhancement

Requires ML Background

Implement code for various feature representations

Currently only feature dictionaries exist - a necessity is the implementation of feature vectors. The feature type returned should be an argument to the FeatureExtractor class.

enhancement

Requires ML Background

High Priority

Andriy Mulyar

Integrate MetamapLite as a component

[FEATURE REQUEST] Functionality for analyzing the differences between two Annotation objects.

Refactor to directly use JSON output from Metamap2016

Create flow diagrams and figures describing how the medaCy works.

Create a docker container with medaCy set up for easy first use

[FEATURE REQUEST] Option for label ranking in outputs

Implement code for various feature representations

Refactor away UnitAnnotator and transition/test individual annotators for each unit type

Make token merging optional during token annotation in each PipelineComponent.

Systematically Adding Tasks to Benchmarks in a Study