mHiC
mHiC copied to clipboard
Step 4 breaks with underscore in chromosome names.
So I have underscores in my chromosome names and the chrList
variable is set to "LtaP_01 LtaP_02 LtaP_03 LtaP_04 LtaP_05 LtaP_06 LtaP_07 LtaP_08 LtaP_09 LtaP_10 LtaP_11 LtaP_12 LtaP_13 LtaP_14 LtaP_15 LtaP_16 LtaP_17 LtaP_18 LtaP_19 LtaP_20 LtaP_21 LtaP_22 LtaP_23 LtaP_24 LtaP_25 LtaP_26 LtaP_27 LtaP_28 LtaP_29 LtaP_30 LtaP_31 LtaP_32 LtaP_33 LtaP_34 LtaP_35 LtaP_36 MaxiA"
This leads to the following error message when Step 4 hits the KR normalization:
Traceback (most recent call last):
File "/home/ec2-user/mHiC/bin/KR_norm_mHiC.py", line 466, in <module>
writeInteraction(norm_mtx, baseName, args.outdir, revFragsDic, args.chrNum, args.resolution)
File "/home/ec2-user/mHiC/bin/KR_norm_mHiC.py", line 357, in writeInteraction
chr1, mid1 = revFragsDic[row[i]].split("_")
ValueError: too many values to unpack
Which looks to be caused because it is parsing chromosome names by underscore. Other than not using underscores in chromosome names, any ideas for a fix? I'll continue to look and see if I come up with something.