Dan King
Dan King
I assigned you Jackie since you've been looking at this. Feel free to unassign if you stop looking into it!
A fix to this issue would detect which FASTAs and which chain files (for liftovers) are needed by a pipeline (a call to `ServiceBackend.execute`) and only mount the necessary ones.
Ideally we want an arbitrary aggregator over any index. That's a bit of work though.
Each split is 3GB uncompressed and the mean on-disk split is 6MB? That's a wild compression ratio. Hmm. Maybe by default we should use concurrency = 1 in DataFusion? It...
Reported here: https://hail.zulipchat.com/#narrow/stream/123010-Hail-Query-0.2E2-support/topic/Element.20Wise.20Sums
Ideally this would also work with Polars, DataFusion & DuckDB.
Hi @aditanase ! Thanks for creating this issue! Issue #2657 tracks (a portion of) our string wishlist. In the single column case, what you've described above is implemented as the...
Hey @frederikja163 ! Welcome! You are correct, this function is not defined, but it should be. We are in the midst of a rather significant change to the Vortex library...