Michael Selik

Results 9 issues of Michael Selik

Not sure what's going on here. ``` In [44]: patsy.dmatrix("C(a)", {'a':['m', 'n', 'o']}) Out[44]: --------------------------------------------------------------------------- TypeError Traceback (most recent call last) in () ----> 1 dmatrix("C(a)", {'a':['m', 'n', 'o']}) /Users/mike/venv/sci/lib/python2.7/site-packages/patsy/highlevel.pyc...

It seems some archaic FDA submission rules require(d) SAS XPT or CPT-format files. The [Aggregate Analysis of ClinicalTrials.gov Database](https://www.ctti-clinicaltrials.org/aact-database) hosts the same data in Oracle "dmp", pipe-delimited text, and SAS...

enhancement

SAS's publicly available technical paper, _Record Layout for a SAS® Version 8 or 9 Data Set in SAS® Transport Format_ describes handling format and informat names longer than 8 characters:...

enhancement

Comments in Pandas issue #44866 explain that support for `DataFrame.attrs` and `Series.attrs` is more robust than for `._metadata`. https://github.com/pandas-dev/pandas/issues/44866

The XPORT format specifies that the file is padded with `b' '` to ensure the total file length (in bytes) is a multiple of 80. If there are no numeric...

bug

It should be easy enough to publish the docs website either on GitHub, ReadTheDocs, or somewhere similar.

enhancement

These sound like good ideas. http://agdr.org/2020/05/14/Polyglot-Makefiles.html https://tech.davis-hansson.com/p/make/

enhancement

Why not? Let's make this an all-around SAS file reader/writer.

enhancement

Some XPORT files can be quite large, so it'd be nice to make string decoding faster. I suspect Cython could give us a boost.

enhancement