eggo icon indicating copy to clipboard operation
eggo copied to clipboard

1000 Genomes Phase 3 VCF data set to be hosted in S3 in parquet format

Open laserson opened this issue 10 years ago • 4 comments

laserson avatar Sep 15 '15 23:09 laserson

OOC, is the ALL.wgs.phase3_shapeit2_mvncall_integrated_v5b.20130502.sites.vcf.gz file here an example of the VCF you're referring to?

ryan-williams avatar Sep 16 '15 22:09 ryan-williams

I believe so, but I'd like to take the genotypes as well, not just the variants.

laserson avatar Sep 17 '15 07:09 laserson

You already have these at s3://bdg-eggo/1kg/genotypes/, no?

fnothaft avatar Sep 17 '15 14:09 fnothaft

I think those were phase 1. And outdated anyway...wanna redo it.

laserson avatar Sep 18 '15 02:09 laserson