ReadStat
ReadStat copied to clipboard
[Feature Request] Support for import locale-specific encoding strings in SPSS sav
Hi, Thank you for providing such a great tool.
Usually ReadStat can successfully import UTF-encoded datasets, but when reading locally encoded .sav files, it will prompt "Unable to convert string to the requested encoding (invalid byte sequence)"
I provide two sample .sav datasets(remove its .zip suffix) here for testing:
165-locale-specific.savdataset encoded by Chinese locale-specific encoding165-UTF-encoding.savChinese character dataset encoded by UTF 165-locale-specific.sav.zip 165-UTF-encoding.sav.zip
In ReadStat and Haven, the first dataset could not be read with the above error,But GNU PSPP can successfully open it. I guess that when encountering a localized encoded file, try to convert it to UTF through some methods and then read it, but I haven't seen such a practice yet(maybe the practice of PSPP is a success ). Could you consider implementing it.
Thank you!