Multi-GPU support with dask
This adds dask support
Functions to add:
- [x]
calculate_qc_metrics - [x]
normalize_total - [x]
log1p - [x]
highly_variable_geneswithseuratandcell_ranger - [x]
scale - [x]
PCA - [ ]
neighbors
There will be a seperate PR for the update of the docstrings and a tutorial.
I renamed the functions for QC and renamed some of the variables so its a bit clearer whats happening.
https://github.com/scverse/rapids_singlecell/pull/179/files#r1838498091 is not done and from what I can tell #179 (review) has not been addressed. What happens if you pass a csc dask array to pca?
That will just error. And tell the user to please give me dense or csr as meta. I updated _check_gpu_X to reflect that.
The median I'll test today
We should look into the cost of allocating ahead of time for all operations that are currently in-place
Median out of core is a bad choice. Uses way more memory and is slower. Loose Loose