gtz icon indicating copy to clipboard operation
gtz copied to clipboard

hisat2默认流程下处理的bam文件不能用gtz压缩

Open boiscat opened this issue 4 years ago • 1 comments

ref文件:genome.fa (hg19) 下载地址:https://hgdownload.soe.ucsc.edu/goldenPath/hg19/bigZips/chromFa.tar.gz

hisat2运行的ref: hg19 genome_tran 下载地址:https://ccb.jhu.edu/software/hisat2/manual.shtml

已经通过默认的hisat2流程生成bam文件(没有用hisat2-gtz),gtz打包时报错如下:


prepare compression... 100%, cost 23s (18|5)
Enabling high-rate compression mode with /home/.config/gtz/genome.fa-D2A70550489DE356A2CD6BFC40711204.bam.rbin2( hardware speedup )
[                                                  ] 0%

RNAME "1" was not found in reference file: /home/Erythropoiesis_APA/HSC_APA/hsc_apa1/04-salmon/00-file/genome.fa
error: the reference file was detected to not match this BAM file!

boiscat avatar Jun 20 '20 02:06 boiscat

@boiscat ,你好,这里报错是因为gtz检测到bam这行数据,第三个域内容为"1"(第三个域对应为RNAME),该RNAME在ref中没有找到,这里有两种可能:

1、用gtz压缩bam时,--ref参数所指定的ref文件不是hisat2生成bam对应的ref文件(这里应该指定为你提到的“hisat2运行的ref: hg19 genome_tran”)

2、该bam对应报错行,数据有问题,譬如少了域?因为"1"不像是一个正常的RNAME

xuxiali13 avatar Jun 22 '20 01:06 xuxiali13