Joris Van den Bossche

Results 190 issues of Joris Van den Bossche

Overview of work for the future string dtype ([PDEP-14](https://pandas.pydata.org/pdeps/0014-string-dtype.html)). Main implementation: - [x] Implement the object-dtype based fallback: - [x] https://github.com/pandas-dev/pandas/pull/58451 - [ ] Rename the storage options (from the...

API Design
Strings
Arrow

xref https://github.com/pandas-dev/pandas/issues/57431 I _think_ we are sure that the `values` from `self.get_new_values(..)`, which we pass here to the DataFrame constructor, are always new values owned by us (not shared by...

Performance
Copy / view semantics

Closes https://github.com/pandas-dev/pandas/issues/50756 xref https://github.com/pandas-dev/pandas/issues/57431 With CoW enabled now, the default behaviour of `DataFrame(ndarray)` is to copy the numpy array (before 3.0, this would not copy the data). However, if we...

Performance
Copy / view semantics

Currently, the footer start/center/end sections are always placed next to each other. On mobile, when using more than 1 section (as the default does using start/end, or our own docs...

kind: enhancement
tag: design

### Rationale for this change NumPy 2.0 is changing some ABI, see the issue description and https://github.com/numpy/numpy/pull/25946 for more details. The changes here should make our code compatible both with...

Component: Python
awaiting merge

From https://github.com/apache/arrow/pull/34289#pullrequestreview-1355094099 Currently, the `pyarrow.array(..)` constructor is meant to create Array object, but can return a ChunkedArray instead in two cases: 1) the object is too big to fit into...

Component: Python

### Rationale for this change From debugging the failure, it seems this is due to pandas changing a filter operation to sometimes preserve a RangeIndex now instead of returning an...

Component: Python
awaiting committer review

See eg https://github.com/ursacomputing/crossbow/actions/runs/8104554803/job/22151387751 ``` ___________________________ test_filters_equivalency ___________________________ tempdir = PosixPath('/tmp/pytest-of-root/pytest-0/test_filters_equivalency0') @pytest.mark.pandas def test_filters_equivalency(tempdir): local = LocalFileSystem() base_path = tempdir integer_keys = [0, 1] string_keys = ['a', 'b', 'c'] boolean_keys =...

Component: Python
Component: Continuous Integration

The current build system of pyarrow consists of using setuptools with a custom `setup.py` that calls cmake to do the actual build. While this works, there are nowadays newer python...

Component: Python

### Version 23.06.00a+39.g86e8912a ### On which installation method(s) does this occur? Conda ### Describe the issue When you have a GeoSeries of points with coordinates that have missing values (null,...

bug
Needs Triage