eggo
eggo copied to clipboard
1000 Genomes Phase 3 VCF data set to be hosted in S3 in parquet format
OOC, is the ALL.wgs.phase3_shapeit2_mvncall_integrated_v5b.20130502.sites.vcf.gz file here an example of the VCF you're referring to?
I believe so, but I'd like to take the genotypes as well, not just the variants.
You already have these at s3://bdg-eggo/1kg/genotypes/, no?
I think those were phase 1. And outdated anyway...wanna redo it.