serde_arrow
serde_arrow copied to clipboard
is there a way to change the index type of dictionary encodings of strings?
uint32 specifically doesn't seem to work with pandas... (the arrow batch readers resulting cannot do read_pandas() for example).
I'm trying an overwrite with a custom dictionary key field DataType.....
Changing this index is currently not implemented. You could manually overwrite the corresponding fields. But I would also be happy to accept a patch that adds this option to the current code.
FYI. I chose uint32 as it is used by polars as its index type.
Yes I got it working with overwrites, I'll see about a patch when I am back at my desk!