mimic-code icon indicating copy to clipboard operation
mimic-code copied to clipboard

emar event_txt " in Other Location" entries

Open justinrporter opened this issue 1 year ago • 1 comments

Prerequisites

  • [x] Put an X between the brackets on this line if you have done all of the following:
    • Checked the online documentation: https://mimic.mit.edu/
    • Checked that your issue isn't already addressed: https://github.com/MIT-LCP/mimic-code/issues?utf8=%E2%9C%93&q=

Description

In MIMIC-IV v1.0 emar.csv, there are a few (37520) event_txt entries that contain the value in Other Location (with a leading space). I expected to see values like {Administered,Applied,Started,Flushed} in Other Location and similar, but there seems to be a subgroup with this mangled (?) entry that is difficult to interpret.

In the absence of an obvious interpretation for " in Other Location" my best guess is that this is some kind of ETL bug?

Cheers and thanks for the great resource!

justinrporter avatar Jul 22 '22 19:07 justinrporter

Looking at emar in v1, I see a number of other rows that contain “in Other Location”:

MariaDB [mimic4v10]> select event_txt, count(*) n from emar where event_txt like "%in Other Location" group by event_txt order by n desc; +--------------------------------------------------+-------+ | event_txt | n | +--------------------------------------------------+-------+ | in Other Location | 37520 | | Stopped in Other Location | 25226 | | Administered in Other Location | 24108 | | Flushed in Other Location | 6313 | | Started in Other Location | 4475 | | Confirmed in Other Location | 1632 | | Applied in Other Location | 1241 | | Stopped - Unscheduled in Other Location | 795 | | Assessed in Other Location | 481 | | Removed in Other Location | 168 | | Removed Existing / Applied New in Other Location | 59 | | Restarted in Other Location | 7 | | Rate Change in Other Location | 6 | | Read in Other Location | 3 | | Not Given per Sliding Scale in Other Location | 3 | | Infusion Reconciliation in Other Location | 1 | +--------------------------------------------------+-------+ 16 rows in set (7.456 sec)

(For me, the first one does not start with a blank space because my load script applies trim() to the data.) Someone needs to figure out what all these "in Other Location” terms mean.

BTW, in v2, the same entries occur, but with slightly different counts: MySQL [mimic4v20]> select event_txt, count(*) n from emar where event_txt like "%in Other Location" group by event_txt order by n desc; +--------------------------------------------------+-------+ | event_txt | n | +--------------------------------------------------+-------+ | in Other Location | 36469 | | Stopped in Other Location | 25916 | | Administered in Other Location | 24209 | | Flushed in Other Location | 6347 | | Started in Other Location | 4588 | | Confirmed in Other Location | 1655 | | Applied in Other Location | 1272 | | Stopped - Unscheduled in Other Location | 795 | | Assessed in Other Location | 486 | | Removed in Other Location | 168 | | Removed Existing / Applied New in Other Location | 61 | | Restarted in Other Location | 7 | | Rate Change in Other Location | 6 | | Read in Other Location | 3 | | Not Given per Sliding Scale in Other Location | 3 | | Infusion Reconciliation in Other Location | 1 | +--------------------------------------------------+-------+ 16 rows in set (18.331 sec)

On Jul 22, 2022, at 3:19 PM, Justin R. Porter @.@.>> wrote:

Prerequisites

  • Put an X between the brackets on this line if you have done all of the following: * Checked the online documentation: https://mimic.mit.edu/ * Checked that your issue isn't already addressed: https://github.com/MIT-LCP/mimic-code/issues?utf8=%E2%9C%93&q=

Description

In MIMIC-IV v1.0 emar.csv, there are a few (37520) event_txt entries that contain the value in Other Location (with a leading space). I expected to see values like {Administered,Applied,Started,Flushed} in Other Location and similar, but there seems to be a subgroup with this mangled (?) entry that is difficult to interpret.

In the absence of an obvious interpretation for " in Other Location" my best guess is that this is some kind of ETL bug?

Cheers and thanks for the great resource!

— Reply to this email directly, view it on GitHubhttps://github.com/MIT-LCP/mimic-code/issues/1347, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AA2BACYGQ6WXBKWN5J66TADVVLX2LANCNFSM54MQVAIQ. You are receiving this because you are subscribed to this thread.Message ID: @.***>

pszolovits avatar Jul 22 '22 19:07 pszolovits