MedCATtutorials icon indicating copy to clipboard operation
MedCATtutorials copied to clipboard

Questions About Example Part_3_2_Extracting_Diseases_from_Electronic_Health_Records.ipynb

Open JBarsotti opened this issue 8 months ago • 5 comments

This is an amazing module. Thanks for all your hard work.

I was working through the notebook notebooks/introductory/Part_3_2_Extracting_Diseases_from_Electronic_Health_Records.ipynb, and I have a couple of questions:

  1. Why do we need to retrain the modelpack on our own personal? I've tried it without retraining, and it seems to work okay, still. Am I missing something?
  2. I have access to the entire UMLS database. I tried to use that as my medpack model, but it doesn't seem to work with the code in notebooks/introductory/Part_3_2_Extracting_Diseases_from_Electronic_Health_Records.ipynb. Even on the simple example "This patient suffers from diabetes," it isn't able to recognize diabetes as an entity. When I run it on large clinical notes, it does not return a lot of CUIs that map to preferred names. They are just listed as "Unknown." Any ideas?

Thanks for an awesome module! It really is great.

JBarsotti avatar Jun 21 '24 06:06 JBarsotti