dask issues

Index Query Hangs

4

**Describe the issue**: After setting index to timestamp, some loc based query works but string based querying causes the operation to hang unless we call optimize first **Minimal Complete Verifiable...

mscanlon-exos

dataframe

needs attention

bug

Infer schema does not work properly for categorical types with high cardinality values

Using categorical type with high cardinality values that do not fit into `int8` leads to error: ``` ValueError: Failed to convert partition to expected pyarrow schema: `ArrowInvalid('Integer value 575 not...

dbalabka

needs attention

needs triage

Track unknown shapes

1

I would like to suggest a new feature that would make it possible to run `a[mask] = b[mask]`, where a, b, and mask are all Dask arrays. This is currently...

crusaderky

array

discussion

p2

feature

FutureCancelledError when applying map_partitions on un-aligned dataframes

1

**Describe the issue**: The new implementation of `map_partitions` does not have the same behavior as the original one when dealing with multiple un-aligned dataframes. I understand that this is not...

mlemainque

needs triage

Link fixed for documentation

5

Fixed the issue #11838 where user can land the exact documentation page for dask.

O-sama12

needs attention

Investigate using da.pad in da.overlap

27

The `da.overlap` module was written to handle `map_overlap` and all of its intricacies. It provides some support for adding padding to edge chunks. Though it only supports a few modes...

jakirkham

array

Enable column projection in ``MapPartitions``

5

This feature was a requested at the last [Dask Community Meeting](https://docs.google.com/document/d/1UqNAP87a56ERH_xkQsS5Q_0PKYybd5Lj2WANy_hRzI0/edit?tab=t.0). Adds a `required_columns` argument to `map_partitions`. If this argument is specified, column projections are no-longer blocked by `MapPartitions` expressions.

rjzamora