pandera
pandera copied to clipboard
serialization support for Polars and Spark
Hi @cosmicBboy and all contributors to this great framework,
I was wondering if there are any plans to implement schema serialization for the Polars and Spark engines.
Thanks!
there are now! 😀
just relabeld this as an enhancement.
For anyone in the community who wants to implement this, this has my blessing!
Basically they would need to:
- implement
spark_io.pyandpolars_io.pymodules here - rewrite the call site that's currently in the generic DataFrameSchema type: https://github.com/unionai-oss/pandera/blob/main/pandera/api/dataframe/container.py#L1243-L1254
- implement the
to_json,to_yaml,from_json,to_yamlmethods in the pandas/polars/pyspark-specific DataFrameSchema classes.