dataflow-java icon indicating copy to clipboard operation
dataflow-java copied to clipboard

Update ShardedBAMWriting to be compatible with parallel BAM reading.

Open deflaux opened this issue 8 years ago • 0 comments

WriteBAMTransform used by ShardedBAMWriting assumes that it receives reads in order.

Add a group by and a sort operation so that input reads can be read from multiple BAM shards in parallel.

deflaux avatar Sep 15 '17 17:09 deflaux