dataflow-java
dataflow-java copied to clipboard
Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.
Bumps [com.github.samtools:htsjdk](https://github.com/samtools/htsjdk) from 1.128 to 3.0.1. Release notes Sourced from com.github.samtools:htsjdk's releases. 3.0.1 Fix for a long standing vulnerability around temporary directory creation which could expose data to malicious users...
Bumps [junit:junit](https://github.com/junit-team/junit4) from 4.11 to 4.13.1. Release notes Sourced from junit:junit's releases. JUnit 4.13.1 Please refer to the release notes for details. JUnit 4.13 Please refer to the release notes...
_This issue was automatically created by [Allstar](https://github.com/ossf/allstar/)._ **Security Policy Violation** Project is out of compliance with Binary Artifacts policy: binaries present in source code **Rule Description** Binary Artifacts are an...
There haven't been any updates on this repo in 2 years and the documentation is out-of-date #223 Seems like this repo should be deprecated so as not to confuse customers.
https://cloud.google.com/genomics/install-genomics-tools doesn't seem to exist.
In LoadReadsToBigQuery, we currently ignore info filed map in Reads proto. We should export the data in this field to BigQuery too.
[WriteBAMTransform](https://github.com/googlegenomics/dataflow-java/blob/master/src/main/java/com/google/cloud/genomics/dataflow/writers/bam/WriteBAMTransform.java) used by [ShardedBAMWriting](https://github.com/googlegenomics/dataflow-java/blob/master/src/main/java/com/google/cloud/genomics/dataflow/pipelines/ShardedBAMWriting.java) assumes that it receives reads in order. Add a group by and a sort operation so that input reads can be read from multiple BAM shards...
I am trying to apply the IdentityByState pipeline to my variant data, but it reliably (n=4) fails with a write error. Error message: ``` (dd6b6f2b6ea510df): Workflow failed. Causes: (dd6b6f2b6ea5110a): S07:VariantStreamer/ParDo(RetrieveVariants)+VariantStreamer/ParDo(ConvergeVariantsList)+JoinNonVariantSegmentsWithVariants.BinShuffleAndCombineTransform/ParDo(BinVariants)+JoinNonVariantSegmentsWithVariants.BinShuffleAndCombineTransform/GroupByKey/Reify+JoinNonVariantSegmentsWithVariants.BinShuffleAndCombineTransform/GroupByKey/Write...