Guido Imperiale

Results 173 comments of Guido Imperiale

Something's wrong, beyond the .bs files. As a temporary solution before putting my hands in the buttersink code, I wrote this: ``` #!/usr/bin/env python """"Tag all buttersink snapshot files with...

Adding the -d flag gives me some new insight of what's going on: ``` buttersink -d -e -n /btrfs/ezgi/ s3://crusaderky-buttersink/ezgi/ Listing S3 Bucket "crusaderky-buttersink" contents... Optimal synchronization: 653.6 GiB from...

This has already been merged as part of #10939

@jrbourbeau @phofl how should we deal with this? It's impossible to replicate pandas' current (and deprecated) behaviour, as it would require us to eagerly compute everything until that moment for...

A third option is to print a RuntimeWarning at graph definition time every time lhs.dtype != rhs.dtype, and then implement pandas 3's behaviour of sticking with lhs.dtype.

> we can just filter the warning and be done with it? that would mean that out.compute().dtype != out.dtype, and that the output dtype will change overnight when the user...

https://github.com/dask/dask/pull/10932 fixes the flaky test

@fjetter offline you expressed concern for dask-expr optimization performance. I'm observing a 50\~150ms slowdown for the full TPCH queries. IMHO it's negligible. runtime for graph definition + optimization. Note that...

All PRs are now only waiting for review