BayesTyper icon indicating copy to clipboard operation
BayesTyper copied to clipboard

[feature request] Genotyping more than 500 samples

Open Sherry520 opened this issue 3 years ago • 3 comments

I need to make a genotyping more than 500 samples, I hope the software can add this feature

Sherry520 avatar Dec 19 '20 08:12 Sherry520

Hi, due to how the genotyping algorithm is designed it will unfortunately not scale well too that many samples. What you could do instead is to combine all predicted variants across all 500 samples and then run BayesTyper on each sample independently or in batches using this combined variants set. See here for more information: https://github.com/bioinformatics-centre/BayesTyper/wiki/Executing-BayesTyper-on-sample-batches

Please let me know if you have any other questions.

jonassibbesen avatar Jan 15 '21 23:01 jonassibbesen

@jonassibbesen I followed the method “Executing BayesTyper on sample batches” to genotyping my samples, when I Combine the the batch vcf files using bcftools merge, Error occured: Failed to open bayestyper_rmdup_DH_00_unit_1/bayestyper-sk-b73.vcf.gz: not compressed with bgzip it seems like bayestypre use gzip to commpress the vcf files, But bcftools require bgzip commpressed vcf files

Sherry520 avatar Jan 27 '21 03:01 Sherry520

Thank you for mentioning this. I have updated the wiki with an additional bgzip compression step.

jonassibbesen avatar Jan 28 '21 20:01 jonassibbesen