sgd
sgd copied to clipboard
Flags for constructing design matrix is ignored
- [ ]
subset
: a subset of data points; can be a parameter in sgd.control - [ ]
na.action
: how to deal when data has NA; can be a parameter in sgd.control - [ ]
model
: logical value determining whether to output the X data frame - [ ]
x,y
: logical value determining whether to output the x and/or y - [ ]
contrasts
: a list for performing hypothesis testing on other sets of predictors; can be a paramter in sgd.control - [ ]
weights
- [ ]
offset
I plan to add the default value to na.action
, subset
in valid_model_control()
. However, i'll need to call valid_model_control()
in sgd.formula
. So valid_model_control()
will be called twice. I think the redundancy is acceptable.
na.action
and subset
will be shared by all models, so we may also add them as arguments in sgd.formula
. But I think that would make sgd.formula
look too complex.
Do you think we need na.action
and subset
for sgd.matrix
?
as long as there are reasonable defaults, it would make sense to include those.
It also makes sense to profile the code and see how long running the validity check takes On May 9, 2015 4:22 PM, "Panos Toulis" [email protected] wrote:
as long as there are reasonable defaults, it would make sense to include those.
— Reply to this email directly or view it on GitHub https://github.com/airoldilab/sgd/issues/48#issuecomment-100541105.
I haven't formally profiled the code. But I once ran the code with validity checks commented out. There's no significant improvement in performance.