arrowkdb icon indicating copy to clipboard operation
arrowkdb copied to clipboard

Any way to support dictionary encoded columns with ARROW_CHUNK_ROWS?

Open nugend opened this issue 2 years ago • 0 comments

At the moment this throws an unequal length arrays error when attempting to pass the dictionary and indices in for a column represented that way.

The functionality is definitely supported by the AtrowStream format. It seems like the issue is that the MakeDictionary function and the MakeChunkedArray function don’t play nicely together. I’m not sure what the preferred solution is. I’m happy to handle preparing the value array manually and passing the indices in with an explicit reference if that’s what’s needed.

If you want to handle it in the library, my guess is you could handle the values and indices in separate passes?

nugend avatar Nov 14 '23 17:11 nugend