Adapting my dataset to run the code

Open gabrielbianchin opened this issue 3 years ago • 1 comments

Hi, thanks for making your code accessible. I'm trying to adapt my dataset to be able to run with your code. It is not clear to me some files in our data, like how to generate {on}_go_1.pickle, {on}_label_regular_1.npy and {on}_label_matrix_1_sparse.npy. Also, It seems that some files (../Swiss_Prot/seq_{on}.pkl and ../ontology/{on}_go_trunc.pickle) are not available in the repository. Could you help me, please?

May 24 '22 23:05 gabrielbianchin

Hi @gabrielbianchin: Thank you for the interest in our code. The first author Yue would be a better candidate to answer the question, but before he does, my two cents are {on}_label_regular_1.npy and {on}_label_matrix_1_sparse.npy seem to be produced using https://github.com/Shen-Lab/TALE/blob/master/data/ours/convert.py However, that script needs to read some input data that were missing, as you and https://github.com/Shen-Lab/TALE/issues/3 had pointed out.

Hi @YueCao2017 @YueCao94 : We would greatly appreciate your input to address the missing data issue.

May 26 '22 04:05 Shen-Lab