clinical-fusion icon indicating copy to clipboard operation
clinical-fusion copied to clipboard

Confusion for the process of redo the prediction

Open CalendulaED opened this issue 3 years ago • 1 comments

I already have the data extract and place them in the data folder includes: adm_details.csv pivoted_lab.csv pivoted_vital.csv and I suppose I need to run the following code: $ python 00_define_cohort.py # define patient cohort and collect labels $ python 01_get_signals.py # extract temporal signals (vital signs and laboratory tests) $ python 02_extract_notes.py --firstday # extract first day clinical notes $ python 03_merge_ids.py # merge admission IDs $ python 04_statistics.py # run statistics $ python 05_preprocess.py # run preprocessing $ python 06_doc2vec.py --phase train # train doc2vec model $ python 06_doc2vec.py --phase infer # infer doc2vec vectors

however, when it comes to python 06_doc2vec.py --phase train it always shows the error of RuntimeError: you must first build vocabulary before training the model

Are there any steps that I miss such that it will cause this error?

Thank you so much for your help!

CalendulaED avatar Jul 06 '21 03:07 CalendulaED

I think everything before this step is running correctly, and I have tried to change the min_count in the model to 1. But I still got the same error. Can you help me with this error? Thank you!

CalendulaED avatar Jul 06 '21 13:07 CalendulaED