Evan Miller
Evan Miller
This is not currently possible with ReadStat. (For my reference) custom attributes are described in: https://www.gnu.org/software/pspp/pspp-dev/html_node/Data-File-and-Variable-Attributes-Records.html#Data-File-and-Variable-Attributes-Records
ReadStat 1.1.0 includes a new offset/limit API to facilitate chunking: https://github.com/WizardMac/ReadStat/blob/master/src/readstat.h#L377 ```C readstat_error_t readstat_set_row_limit(readstat_parser_t *parser, long row_limit); readstat_error_t readstat_set_row_offset(readstat_parser_t *parser, long row_offset); ```
What is your output format? SAV, DTA, etc
Any ideas, @ivarref?
Just FYI what you are calling the "code point" representation is actually Latin-1, see https://en.wikipedia.org/wiki/ISO/IEC_8859-1 Is SPSS producing files containing both UTF-8 and Latin-1 data?
@pdbailey0 If you download the standalone `readstat` utility, it will report the file's self-reported encoding. ```bash $ readstat binlfp2.sav Format: SPSS binary file (SAV) Columns: 8 Rows: 753 Table label:...
Thanks, I'll look at this later today.
These look like two separate errors. I realize the error messages aren't especially helpful. The first error is a shortcoming in `extract_metadata`, which only recognizes one Stata date format (`%td`)....
> Is there a chance to include more flags to the `readstat` CLI to support encoding? Because if `readstat` cannot open certain files, I am not exactly sure what use...
> Currently, I'm the only developer working on this project, so I would be happy to try implementing this flag on readstat if you point me where to start. Sure,...