hail
hail copied to clipboard
Cloud-native genomic dataframes and batch computing
CHANGELOG: Fix many issues, including (hail#14274), with `hailctl batch submit` introduced in 0.2.127. Fixes https://github.com/hail-is/hail/issues/14274 Replacing PR #14186
This PR adds the job groups functionality as described in this [RFC](https://github.com/hail-is/hail-rfcs/pull/5) to the Batch backend and `hailtop.batch_client`. This includes supporting nested job groups up to a maximum depth of...
Fixes #13971 CHANGELOG: Hail now supports and primarily tests against Dataproc 2.2.5, Spark 3.5.0, and Java 11. We strongly recommend updating to Spark 3.5.0 and Java 11. You should also...
Stacked on https://github.com/hail-is/hail/pull/14316. These are tests for the bug fixed by https://github.com/hail-is/hail/issues/14130.
Stacked on https://github.com/hail-is/hail/pull/14313. It is currently too hard to see what parts of python jobs are well tested or not.
This was spawned by https://github.com/hail-is/hail/pull/14310 which slipped into 0.2.126 because we do not have types on some of our inner functions. In my attempt to type everything, I discovered a...
### What happened? I expected when I submitted a Batch job with `attributes={'foo': 1}` that it would return an error telling me the values must be strings. Instead, I got...
CHANGELOG: Introduce `hailctl fs sync` which robustly transfers one or more files between Amazon S3, Azure Blob Storage, and Google Cloud Storage. There are really two distinct conceptual changes remaining...
In particular, this allows `async with RouterAsyncFS(...) as fs` to give a tight type to `fs`.
Stacked on https://github.com/hail-is/hail/pull/14315