ReadStat
ReadStat copied to clipboard
Feature request: Add support for writing output directly to Parquet files
For example using https://github.com/apache/parquet-cpp .
This would be a great feature to add. I'm guessing that this would be a difficult addition, however, since based on memory usage it seems like Readstat iterates over rows a few at a time. In order to export to Parquet (since it's a column-oriented format) you'd have to import many more rows at once in order to transpose from rows to columns.
Additionally, the above link is outdated. Parquet-cpp is now included within the Arrow repository: https://github.com/apache/arrow