dask
dask copied to clipboard
Parallel computing with task scheduling
**Describe the issue**: After setting index to timestamp, some loc based query works but string based querying causes the operation to hang unless we call optimize first **Minimal Complete Verifiable...
Using categorical type with high cardinality values that do not fit into `int8` leads to error: ``` ValueError: Failed to convert partition to expected pyarrow schema: `ArrowInvalid('Integer value 575 not...
I would like to suggest a new feature that would make it possible to run `a[mask] = b[mask]`, where a, b, and mask are all Dask arrays. This is currently...
**Describe the issue**: The new implementation of `map_partitions` does not have the same behavior as the original one when dealing with multiple un-aligned dataframes. I understand that this is not...
Fixed the issue #11838 where user can land the exact documentation page for dask.
The `da.overlap` module was written to handle `map_overlap` and all of its intricacies. It provides some support for adding padding to edge chunks. Though it only supports a few modes...
This feature was a requested at the last [Dask Community Meeting](https://docs.google.com/document/d/1UqNAP87a56ERH_xkQsS5Q_0PKYybd5Lj2WANy_hRzI0/edit?tab=t.0). Adds a `required_columns` argument to `map_partitions`. If this argument is specified, column projections are no-longer blocked by `MapPartitions` expressions.
SciPy < 1.15 does not support 1D sparse slices.
Would be nice if somehow `cytoolz` was noted as an optional dependency in `setup.py`. Not totally sure the right place to add it though.
It should go here: https://docs.dask.org/en/stable/ since that is the documentation.