mimic3-benchmarks
mimic3-benchmarks copied to clipboard
Add commandline args for outlier detection, rescaling
Make outlier detection and input rescaling optional based on command-line args
I think this is done, right @Harhro94?
No, it isn't.
Before doing this we should resolve the inconsistencies in the column names of item_id_to_variable_map.csv
and variable_ranges.csv
files (reported in #28). Right now this inconsistency doesn't affect the code, since we don't do resaling and outlier detection.
Right, I'll take a look this week.
Hi @turambar @hrayrhar,
Greetings. Thank you for the work you have done to create a benchmark dataset and tasks!
Is there any update on this issue to remove outliers?
I am using the dataset generated in this repository for my research. I noticed that for some variables, i.e: weight (box plot below), the range of values is large and the box plot indicates outliers. I think these values are adversely affecting the machine learning model that I am researching. Hence, I am looking at ways to correct these outliers.