snowpark-python icon indicating copy to clipboard operation
snowpark-python copied to clipboard

SNOW-2409156: Add write_parquet function equivalent to write_arrow

Open nicornk opened this issue 2 months ago • 3 comments

What is the current behavior?

There is no native function in snowpark-python or the snowflake-python-connector to ingest a folder of parquet files in a performant way to Snowflake. The closest is Session.write_arrow which already has all required logic.

What is the desired behavior?

Add Session.write_parquet(folder_with_parquet_files, ...) that reuses the logic from write_arrow. Refactor write_arrow to first create the parquet files and than calls Session.write_parquet

How would this improve snowflake-snowpark-python?

Many customers already have parquet files written by other engines that need to be quickly ingested into Snowflake. Avoiding the arrow step adds the most performant ingestion method to snowpark-python.

References, Other Background

nicornk avatar Oct 13 '25 19:10 nicornk

I will prepare a Pull Request for this FR.

nicornk avatar Oct 13 '25 19:10 nicornk

Thank you for the PR @nicornk . Team will review and approve accordingly.

sfc-gh-sghosh avatar Nov 09 '25 07:11 sfc-gh-sghosh

@nicornk thank you for the PR. Do you think the existing session.file.put(), then session.read.parquet() work for you?

sfc-gh-yixie avatar Nov 13 '25 06:11 sfc-gh-yixie