hail
hail copied to clipboard
Cloud-native genomic dataframes and batch computing
See https://github.com/broadinstitute/gnomad-browser/issues/914. In [the line in question](https://github.com/broadinstitute/gnomad-browser/blob/b497106d97773affd81b48eadfa5586259e011e5/data-pipeline/src/data_pipeline/data_types/gtex_tissue_expression.py#L14), we attempt to export a `Table` with ~13,000 columns, and get the following error: `is.hail.relocated.org.objectweb.asm.MethodTooLargeException: Method too large: __C19580collect_distributed_array.__m19633split_InsertFields ()V` (see above-referenced issue...
These should never have been read-write. Caught this because a CI job I was modifying overwrote `/gsa-key/key.json` with `/test-gsa-key/key.json` which caused the Output step to use the test credentials instead...
This isn't working yet, but I think it's pretty close. It transforms a BlockMatrix into a row major table of entries. Things to note: 1. I'm not respecting `partitionRanges` right...
~Stacked on #11045~ Implements a method to orthogonalize each row of (an entry field of) a matrixtable against all previous rows within a specified window size. The method has time...
Some delimited text processors use quotes to escape quotes so that the string `a"b` is rendered as `a""b`. Moreover an individual entry of the delimited text is itself wrapped in...
See the transcript below. This is particularly confusing for users because python often elides the non-printable characters. A small wrinkle of confusion is that the UTF-8 BOM, `ef bb bf`,...
`hailctl dataproc` can pass through gcloud configurations specified with `--configuration` to the `gcloud dataproc` commands that it runs. However, that argument is often not respected for other `gcloud` commands that...
Hail version: 0.2.55-0d4ce0df2457 The behavior of the second parameter to [StringExpression's split method](https://hail.is/docs/0.2/hail.expr.StringExpression.html#hail.expr.StringExpression.split) is determined by its Java implementation and differs from Python [string's split method](https://docs.python.org/3/library/stdtypes.html#str.split). > The limit parameter...
To replicate, replace the contents of `test_king.py::test_king_small` with: ``` @fails_local_backend() def test_king_small(): hl.init(idempotent=True) # Should be no error hl.stop() hl.init(idempotent=True) # Should be no error hl.init(hl.spark_context(), idempotent=True) # Should be...