aws-sdk-pandas icon indicating copy to clipboard operation
aws-sdk-pandas copied to clipboard

store_parquet_metadata() with an entire NULL object column

Open thomasmeyler opened this issue 3 years ago • 2 comments

Much like like issue https://github.com/awslabs/aws-data-wrangler/issues/524#issue-790819816 I was wondering whether there was a way to override this error. I have a column that is nearly (99.9% of the time) and it would be great to be able to 'set/override the dtype to a string.

res = wr.s3.store_parquet_metadata( path=path', database="database", table="table", dataset=True, mode="overwrite" )

We can not infer the data type from an entire null object column

thomasmeyler avatar Jun 07 '22 17:06 thomasmeyler

To address this ignore_null was added to read methods in #1125. Would similar thing work for you?

kukushking avatar Jun 10 '22 09:06 kukushking

To address this ignore_null was added to read methods in #1125. Would similar thing work for you?

Yes, i believe that this would work well

thomasmeyler avatar Jun 20 '22 12:06 thomasmeyler

Marking this issue as stale due to inactivity. This helps our maintainers find and focus on the active issues. If this issue receives no comments in the next 7 days it will automatically be closed.

github-actions[bot] avatar Aug 19 '22 15:08 github-actions[bot]

Hi, I'm running into this error and would find an ignore_null flag on store_parquet_metadata very useful. I'm happy to open a PR to make the changes (also my bad if commenting on a closed issue is less than ideal, figured it's better than opening a new one).

raaidarshad avatar Aug 31 '23 19:08 raaidarshad