Rob Currie
Results
2
comments of
Rob Currie
The Treehouse compendium is ~11k samples by 30k genes/features. Storing on disk in an hdf5 file take about 1GB and loads into a dataframe (R or Python) in < 600msec....
On a related vertical note, for the GA4GH [Cancer Gene Trust](https://genomicsandhealth.org/cancer-gene-trust-white-paper-read-online), an IPFS based distributed genomic store, I've implemented an elastic search based [crawler](https://github.com/rcurrie/search-cgt) with [web UI](https://github.com/rcurrie/search-cgt) showing the network...