pyprophet
pyprophet copied to clipboard
Feature/speedup parquet export
Re-implementation of PR https://github.com/PyProphet/pyprophet/pull/111
This uses duckdb for parquet export instead of pandas. This allows for memory and SQL queries to be handled by duckdb which is faster and uses less memory.
Currently, this method only supports a combined output file (no split based on runs) and no IPF
@singjc can you review this when you get a chance?
Looks great, thank you! Please feel free to merge when you think that it is ready.
@singjc suggestions should be addressed now