cnvkit
cnvkit copied to clipboard
ValueError: Duplicated genomic coordinates in sample set
Hi, all, my question is as follows, how can I solve it? thanks
my command: cnvkit.py fix tumor.targetcoverage.cnn tumor.antitargetcoverage.cnn Reference.cnn -o tumor.cnr
Processing target: samplename
Traceback (most recent call last):
File "xxx/cnvkit.py", line 9, in
I got the same problem.
I had the same problem.
I realised that my target files had duplicated lines. I did: target.bed | sort | uniq > new-target.bed and it solved the problem.
In my case, this was because the target bed bore an individual record of the same feature for potential synonyms of that feature. See below:
$ grep 152719709 target.bed
chrX 152719709 152719829 TREX2
chrX 152719709 152719829 HAUS7
This issue was solved with awk '!seen[$1$2$3]++' target.bed > new-target.bad. Only the first record for each identical region is taken. I think it's possible that these are different genes that occupy the same region, which would make this an imperfect solution.