arrowkdb
arrowkdb copied to clipboard
Any way to support dictionary encoded columns with ARROW_CHUNK_ROWS?
At the moment this throws an unequal length arrays error when attempting to pass the dictionary and indices in for a column represented that way.
The functionality is definitely supported by the AtrowStream format. It seems like the issue is that the MakeDictionary function and the MakeChunkedArray function don’t play nicely together. I’m not sure what the preferred solution is. I’m happy to handle preparing the value array manually and passing the indices in with an explicit reference if that’s what’s needed.
If you want to handle it in the library, my guess is you could handle the values and indices in separate passes?