pyreadstat
pyreadstat copied to clipboard
Python package to read sas, spss and stata files into pandas data frames. It is a wrapper for the C library readstat.
I have Mac and get the following error after trying to compile: `ImportError: dlopen(/Users/fatihshen/Documents/magicstat_venv/lib/python3.10/site-packages/pyreadstat/pyreadstat.cpython-310-darwin.so, 0x0002): tried: '/Users/fatihshen/Documents/magicstat_venv/lib/python3.10/site-packages/pyreadstat/pyreadstat.cpython-310-darwin.so' (mach-o file, but is an incompatible architecture (have 'x86_64', need 'arm64e')), '/usr/lib/pyreadstat.cpython-310-darwin.so' (no...
I have worked on improving Pandas' SAS7BDAT reader performance in past couple of days. `pyreadstat` was a great source of reference to fix some bugs, thanks a lot for making...
When using `read_file_in_chunks(..., chunksize=C)`, each dataframe chunk has a `RangeIndex(start=0, stop=C, step=1)`. With `pd.read_sas(..., chunksize=C)` each dataframe chunk has a `RangeIndex(start=n*C, stop=(n+1)*C, step=1)` with `n` = current chunk number. I...
It would be interesting to be able to read spss files with Polars. Pyreadstat provides support to convert to Pandas. Could this be done with Polars? Polars is a great...
I'm getting the following error when trying to open a file generated by Pyreadstat in Stata 17 BE-Basic Edition: > This .dta file format was created by Stata/MP and has...
I have come across a very strange problem today. When reading an SPSS File, (at least) one of the variables is assigned a different name, than it has in the...
When reading and writing spss files with long string variables, the respective variable is being split into several variables. **Reproducing writing issue:** ``` a = pd.DataFrame() a["LongString1"] = ["Lorem ipsum...
Describe the issue A clear and concise description of what the issue is. To Reproduce in one cell you need 755 ascii letters followed by a non-ascii character, you need...
Dataframes can be easily merged using `pd.merge()`; on the other hand, merge metadata is a pain. It would be great if it would be possible to have a `pyreadstat.merge_sav()` method...
Support to retrieve (meta) data on MRVs / multiple answer question groupings would be great.