Monopogen icon indicating copy to clipboard operation
Monopogen copied to clipboard

Explanation .gl, .gp, .phased files

Open rafaella-buzatu opened this issue 4 months ago • 1 comments

Hello! While running Monopogen, I noticed that it outputs quite a number of different files. I have read in your tutorial that the final output should be in the .phased.vcf.gz file, however that file only provides the genotype. I wanted to also obtain information about the read depth and allele frequency for those variants. I find that the .gl.vcf.gz file contains information about the depth, while the .gp.vcf.gz contains the genotype and allele frequency. I have also noticed that the .gl.vcf file contains unfiltered variants, while the .gp.vcf seems to contain only filtered variants that are the same as in .phased.vcf.

Could you help me understand what all these files are and how I could go about extracting as much information as possible for all variants (even unfiltered) from them?

Thank you!

rafaella-buzatu avatar Feb 13 '24 12:02 rafaella-buzatu