BamHash icon indicating copy to clipboard operation
BamHash copied to clipboard

Read count in paired fastq bamhash computation

Open stolpeo opened this issue 6 years ago • 0 comments

The read count of a bamhash computation of paired fastq files is not the same as the read count of a bamhash computation of the bam file the paired fastq files were converted from (as already mentioned in https://github.com/DecodeGenetics/BamHash/issues/4#issuecomment-157213763 ... actually it is half the amount of reads in the fastq bamhash computation):

$ bamhash_checksum_bam sample.bam
c0039f91693d4bfd	1749217454

$ bamhash_checksum_fastq sample_R1.fq sample_R2.fq
c0039f91693d4bfd	874608727

My expectation would be that the read count numbers are the same (I would expect the number from the bam bamhash computation). Is this behavior intentional? Otherwise it would be great if this could be fixed!

$ bamhash_checksum_fastq --version
bamhash_checksum_fastq version 1.1

Thanks, Oliver

stolpeo avatar Jul 12 '17 09:07 stolpeo