pyreadstat icon indicating copy to clipboard operation
pyreadstat copied to clipboard

Python package to read sas, spss and stata files into pandas data frames. It is a wrapper for the C library readstat.

Results 27 pyreadstat issues
Sort by recently updated
recently updated
newest added

I have Mac and get the following error after trying to compile: `ImportError: dlopen(/Users/fatihshen/Documents/magicstat_venv/lib/python3.10/site-packages/pyreadstat/pyreadstat.cpython-310-darwin.so, 0x0002): tried: '/Users/fatihshen/Documents/magicstat_venv/lib/python3.10/site-packages/pyreadstat/pyreadstat.cpython-310-darwin.so' (mach-o file, but is an incompatible architecture (have 'x86_64', need 'arm64e')), '/usr/lib/pyreadstat.cpython-310-darwin.so' (no...

I have worked on improving Pandas' SAS7BDAT reader performance in past couple of days. `pyreadstat` was a great source of reference to fix some bugs, thanks a lot for making...

When using `read_file_in_chunks(..., chunksize=C)`, each dataframe chunk has a `RangeIndex(start=0, stop=C, step=1)`. With `pd.read_sas(..., chunksize=C)` each dataframe chunk has a `RangeIndex(start=n*C, stop=(n+1)*C, step=1)` with `n` = current chunk number. I...

enhancement

It would be interesting to be able to read spss files with Polars. Pyreadstat provides support to convert to Pandas. Could this be done with Polars? Polars is a great...

enhancement
solved on dev

I'm getting the following error when trying to open a file generated by Pyreadstat in Stata 17 BE-Basic Edition: > This .dta file format was created by Stata/MP and has...

enhancement
requires changes in Readstat

I have come across a very strange problem today. When reading an SPSS File, (at least) one of the variables is assigned a different name, than it has in the...

bug
requires changes in Readstat

When reading and writing spss files with long string variables, the respective variable is being split into several variables. **Reproducing writing issue:** ``` a = pd.DataFrame() a["LongString1"] = ["Lorem ipsum...

bug
requires changes in Readstat

Describe the issue A clear and concise description of what the issue is. To Reproduce in one cell you need 755 ascii letters followed by a non-ascii character, you need...

bug
requires changes in Readstat

Dataframes can be easily merged using `pd.merge()`; on the other hand, merge metadata is a pain. It would be great if it would be possible to have a `pyreadstat.merge_sav()` method...

enhancement

Support to retrieve (meta) data on MRVs / multiple answer question groupings would be great.

enhancement
requires changes in Readstat