Michael Selik
Michael Selik
Not sure what's going on here. ``` In [44]: patsy.dmatrix("C(a)", {'a':['m', 'n', 'o']}) Out[44]: --------------------------------------------------------------------------- TypeError Traceback (most recent call last) in () ----> 1 dmatrix("C(a)", {'a':['m', 'n', 'o']}) /Users/mike/venv/sci/lib/python2.7/site-packages/patsy/highlevel.pyc...
It seems some archaic FDA submission rules require(d) SAS XPT or CPT-format files. The [Aggregate Analysis of ClinicalTrials.gov Database](https://www.ctti-clinicaltrials.org/aact-database) hosts the same data in Oracle "dmp", pipe-delimited text, and SAS...
SAS's publicly available technical paper, _Record Layout for a SAS® Version 8 or 9 Data Set in SAS® Transport Format_ describes handling format and informat names longer than 8 characters:...
Comments in Pandas issue #44866 explain that support for `DataFrame.attrs` and `Series.attrs` is more robust than for `._metadata`. https://github.com/pandas-dev/pandas/issues/44866
The XPORT format specifies that the file is padded with `b' '` to ensure the total file length (in bytes) is a multiple of 80. If there are no numeric...
It should be easy enough to publish the docs website either on GitHub, ReadTheDocs, or somewhere similar.
These sound like good ideas. http://agdr.org/2020/05/14/Polyglot-Makefiles.html https://tech.davis-hansson.com/p/make/
Why not? Let's make this an all-around SAS file reader/writer.
Some XPORT files can be quite large, so it'd be nice to make string decoding faster. I suspect Cython could give us a boost.