[Python] Support Binary/StringView in PyArrow
The new Binary and String View format types have been added to C++ (https://github.com/apache/arrow/pull/37792, basic implementation), but not yet exposed to Python.
This is an overview issue of adding support for those to pyarrow:
- [x] #39651
- [x] #39852
- [x] #40092
- [ ] Enable
from_buffers
@jorisvandenbossche There seem to be a long tail of compute functions not currently supported, namely
- casting between String and StringView
- comparison operations
- Scalar generation
Are there separate issues tracking those or is this the one? If this is the one, I'm curious the priority of addressing those
Yes, indeed, in general the string view type is not yet widely supported
casting between String and StringView
That should be working now with the latest 18.0 release
Are there separate issues tracking those or is this the one?
Most of those issues will have to be fixed / implemented on the C++ side. One such issue about adding more functionality is https://github.com/apache/arrow/issues/39634
Scalar generation
What do you mean exactly with this item?
This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 14 days. If this improvement is still desired but has no current owner, please add the 'Status: needs champion' label.