Make a public bucket or NFS filer available w/ sample data
It would be nice for potential users to be able to run the pipeline against some sample input and output data to confirm they're running a pipeline correctly.
Also 3 WGS BAMs here, 1 normal and 2 tumors, for which it would be great to get mutect/strelka calls:
https://console.cloud.google.com/storage/browser/variant-calling-benchmarks-data/aocs/?project=pici-1286
And Arun's DREAM data:
https://console.cloud.google.com/storage/browser/dream-challenge/?project=pici-1286
On Mon, Aug 8, 2016 at 9:45 PM, Jeff Hammerbacher [email protected] wrote:
@smondet https://github.com/smondet notes we can use the DREAM chr20 BAMs by adding :seb-test-nfs-server-vm,/seb-test-storage,Hello.md,/ nfstest1 to CLUSTER_NFS_MOUNTS
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/hammerlab/epidisco/issues/5#issuecomment-238430303, or mute the thread https://github.com/notifications/unsubscribe-auth/AAcjuMHkk0ng9YZe9oIsgMY9MTarCmAhks5qd9uygaJpZM4Jfovl .
Both of those are private buckets. The DREAM data is available in a public bucket at gs://public-dream-data/
The sample data should include reference + index and dbSNP/ExAC/COSMIC
Looks like hg38 is available w/ index: https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0/