PyHealth icon indicating copy to clipboard operation
PyHealth copied to clipboard

Fix/dates v2/symptom extraction addition

Open mariellederocher opened this issue 8 months ago • 1 comments

Contribution for Marielle Derocher and Zohreh Mahdavi for Deep Learning for Healthcare class.

Our goal was to create a method of training a model to extracting symptoms from MIMIC-III clinical notes.

We added two files:

pyhealth/tasks/symptom_extraction.py

The SymptomExtraction task trains a given a token classification model such as Bio_ClinicalBERT to be able to identify symptoms when given clinical notes.

symptom_extraction_mimic.ipynb

This notebook gives an example of how to using the NOTEEVENTS data from MIMIC-III, identify and tokenize the symptoms using sciscpacy and UMLs data, and use the SymtomExtraction task.

mariellederocher avatar May 08 '25 03:05 mariellederocher

Hey Ammara and Marielle, super cool work!

I think there's something we at PyHealth have done a bad job in explaining, specifically our documentation and tutorials (tbh, that's on me), and if you're interested in expanding your PyHealth PR or modifying it for a merge by us. I'd love if you guys could do the following:

  • It seems the symptom extraction "task" is actually a model. if you guys could move it into the models directory, and have it inherit base_model, that would be really cool for us.
  • Let me know if you need help with anything. I'd highly recommend looking at our RNN implementation for it.
  • Thanks for being one of the groups that was really easy to grade this semester. I hope it hasn't been too bad of an experience at DL4H.

jhnwu3 avatar May 13 '25 18:05 jhnwu3

Closing this PR as it lacks proper labeling. Please add appropriate labels and reopen if needed.

jhnwu3 avatar Aug 04 '25 01:08 jhnwu3