clinical-fusion
clinical-fusion copied to clipboard
Confusion for the process of redo the prediction
I already have the data extract and place them in the data folder includes:
adm_details.csv pivoted_lab.csv pivoted_vital.csv
and I suppose I need to run the following code:
$ python 00_define_cohort.py # define patient cohort and collect labels $ python 01_get_signals.py # extract temporal signals (vital signs and laboratory tests) $ python 02_extract_notes.py --firstday # extract first day clinical notes $ python 03_merge_ids.py # merge admission IDs $ python 04_statistics.py # run statistics $ python 05_preprocess.py # run preprocessing $ python 06_doc2vec.py --phase train # train doc2vec model $ python 06_doc2vec.py --phase infer # infer doc2vec vectors
however, when it comes to python 06_doc2vec.py --phase train
it always shows the error of RuntimeError: you must first build vocabulary before training the model
Are there any steps that I miss such that it will cause this error?
Thank you so much for your help!
I think everything before this step is running correctly, and I have tried to change the min_count in the model to 1. But I still got the same error. Can you help me with this error? Thank you!