Patrick Hoefler
Patrick Hoefler
I think it might make sense increasing the minimum version, even if we don't drop everything 1.x related. The changes in pandas 2.0 caused quite some logic in Dask that...
Hi, I created a fork and fixed the build issues (this is because of a newer mistune version). Ideally, I'd like to merge this here again, but I am using...
This isn't stable unfortunately, we can't rely on the index of our original df being sorted: ``` if __name__ == "__main__": client = Client() pdf = pd.DataFrame( {"x": [1, 2,...
Thx for fixing the bot!
> Though I'm not clear on why this matters for algorithms against a string type? I don't think we are talking about the same thing. Even if we agree that...
@mlkui Regarding concat: This should already be zero copy: ``` df = pd.DataFrame({"a": [1, 2, 3]}, dtype="int64[pyarrow]") df2 = pd.DataFrame({"a": [1, 2, 3]}, dtype="int64[pyarrow]") x = pd.concat([df, df2]) ``` This...
@flying-sheep > If this happens, would dtype='string' and dtype='string[pyarrow]' be merged into one implementation? We’re currently thinking about coercing strings in our library, but hesitating because of the unclear future...
That's exactly what we intend to do. The csv default engine will stay the same for the time being
This actually upcasts to object, which means that the warning is kind of correct, but this should definitely work and continue to work. I suspect that we do a align...
We intend to change this to return false (discussed in Basel), should probably get this into 3.0