mongo-arrow icon indicating copy to clipboard operation
mongo-arrow copied to clipboard

Can `find_all_pandas` treat list of struct as nested dataframe?

Open anentropic opened this issue 2 years ago • 1 comments
trafficstars

I have a mongo document which has a list field containing child documents.

Pandas data frames can be nested. And PyArrow has Table and RecordBatch types.

I would like to avoid having to call pandas.json_normalize on the child list and instead have find_all_pandas return directly a nested dataframe.

Would it be possible to use Table or RecordBatch type in the schema to get this behaviour?

anentropic avatar Oct 12 '23 09:10 anentropic

Hi @anentropic, I've created a ticket to track this request: https://jira.mongodb.org/browse/ARROW-185

blink1073 avatar Oct 16 '23 16:10 blink1073