arrow icon indicating copy to clipboard operation
arrow copied to clipboard

[Python] Support Binary/StringView in PyArrow

Open jorisvandenbossche opened this issue 1 year ago • 1 comments

The new Binary and String View format types have been added to C++ (https://github.com/apache/arrow/pull/37792, basic implementation), but not yet exposed to Python.

This is an overview issue of adding support for those to pyarrow:

  • [x] #39651
  • [x] #39852
  • [x] #40092
  • [ ] Enable from_buffers

jorisvandenbossche avatar Jan 16 '24 13:01 jorisvandenbossche

@jorisvandenbossche There seem to be a long tail of compute functions not currently supported, namely

  • casting between String and StringView
  • comparison operations
  • Scalar generation

Are there separate issues tracking those or is this the one? If this is the one, I'm curious the priority of addressing those

a10y avatar Oct 22 '24 18:10 a10y

Yes, indeed, in general the string view type is not yet widely supported

casting between String and StringView

That should be working now with the latest 18.0 release

Are there separate issues tracking those or is this the one?

Most of those issues will have to be fixed / implemented on the C++ side. One such issue about adding more functionality is https://github.com/apache/arrow/issues/39634

Scalar generation

What do you mean exactly with this item?

jorisvandenbossche avatar Nov 05 '24 16:11 jorisvandenbossche

This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 14 days. If this improvement is still desired but has no current owner, please add the 'Status: needs champion' label.

github-actions[bot] avatar Nov 18 '25 11:11 github-actions[bot]