mHiC icon indicating copy to clipboard operation
mHiC copied to clipboard

Step 4 breaks with underscore in chromosome names.

Open aakashsur opened this issue 5 years ago • 8 comments

So I have underscores in my chromosome names and the chrList variable is set to "LtaP_01 LtaP_02 LtaP_03 LtaP_04 LtaP_05 LtaP_06 LtaP_07 LtaP_08 LtaP_09 LtaP_10 LtaP_11 LtaP_12 LtaP_13 LtaP_14 LtaP_15 LtaP_16 LtaP_17 LtaP_18 LtaP_19 LtaP_20 LtaP_21 LtaP_22 LtaP_23 LtaP_24 LtaP_25 LtaP_26 LtaP_27 LtaP_28 LtaP_29 LtaP_30 LtaP_31 LtaP_32 LtaP_33 LtaP_34 LtaP_35 LtaP_36 MaxiA"

This leads to the following error message when Step 4 hits the KR normalization:

Traceback (most recent call last):
  File "/home/ec2-user/mHiC/bin/KR_norm_mHiC.py", line 466, in <module>
    writeInteraction(norm_mtx, baseName, args.outdir, revFragsDic, args.chrNum, args.resolution)
  File "/home/ec2-user/mHiC/bin/KR_norm_mHiC.py", line 357, in writeInteraction
    chr1, mid1 = revFragsDic[row[i]].split("_")
ValueError: too many values to unpack

Which looks to be caused because it is parsing chromosome names by underscore. Other than not using underscores in chromosome names, any ideas for a fix? I'll continue to look and see if I come up with something.

aakashsur avatar Mar 06 '19 20:03 aakashsur