dataflow-java
dataflow-java copied to clipboard
Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.
Once https://github.com/googlegenomics/dataflow-java/pull/140 is in, refactor the ShardedBAMWriting.java into several classes for easier reuse in other pipelines. Should be pretty easy since the code is already separated into inner classes. Also,...
ReadConverter.makeRead takes a SamRecord and produces a model read. I think the reverse process should also live in this file (or at least near this file in this repo). I'll...
This repository is bloated due to checking in a few bundled jars, fixed via https://github.com/googlegenomics/dataflow-java/issues/49 Let's scrub out those obsolete jars via https://help.github.com/articles/remove-sensitive-data/ to reduce the bloat.