bigbang icon indicating copy to clipboard operation
bigbang copied to clipboard

Adding entity recognition and example notebook

Open effyli opened this issue 2 years ago • 3 comments

Adding a named entity recognition method with huggingface. Adding an example notebook on how to use the recognizer with emails and visualize the recognized entities.

effyli avatar Apr 05 '22 12:04 effyli

Thanks for this PR!

The failing test seems to be unrelated -- it's due to deprecation in some dependencies: https://github.com/scverse/anndata/commit/7b1077d5e1a63fef0d47d716d19ea14ad2ad10de

Once I've gotten this test fixed I'll merge this PR.

sbenthall avatar Apr 07 '22 17:04 sbenthall

Does this code have any additional dependencies not yet included in requirements.txt?

sbenthall avatar Apr 07 '22 20:04 sbenthall

import transformers[torch]
import contractions
import spacy

sbenthall avatar Apr 08 '22 13:04 sbenthall

As this is still in an early experimental stage, I would opt for not merging it for now and wait until this work has more matured.

Christovis avatar Aug 31 '22 08:08 Christovis