sgd issues

Convergence diagnostics

1

Ideas - Run chi-squared test sequentially after a batch of iterations to check convergence. This can also be used as a way to stop SGD early rather than running all...

dustinvtran

project

automatic cross validation for parameter tuning

should be as fast as possible. for now do grid search. possible extension: - implementation using bayesian optimization, c.f., ryan adam's work

dustinvtran

feature

AdaGrad vs d-dim

8

Why is the square root in AdaGrad empirically getting better performance? ... or is it? To be analyzed!

dustinvtran

project

sparse matrices support?

3

I want to try `sgd` package - seems it provides a lot of options and must-have features. But why it didn't work with sparse matrices (`Matrix` package, especially `dgCMatrix` class,...

dselivanov

feature

Add argument for sgd.control for specifying different methods of data subsampling

E.g., default is uniform draws, another you can specify a probability vector of dimension N in order to assign weights to do a multinomial draw, another does importance sampling/active learning,...

dustinvtran

project

Randomly initialize at 0 with normally distributed epsilon, say, `eps=1e-5` standard deviation. See #58. To tune hyperparameters: run SGD to get best estimates for a particular choice of hyperparameters. Then...

dustinvtran

feature

restructure and add code for iteratively reweighted least squares

1

dustinvtran

algorithm

project

Plot diagnostics

5

These must allow one to specify multiple sgd objects to plot. - [x] MSE - [x] Classification error - [ ] Evaluation of cost function available x-axis for each of...

dustinvtran

interface

Implement five (5) different modes for setting the learning rate

2

I believe the user should have the following options for the learning rate. - [ ] Manual: Should be possible to set the learning rate manually - [ ] Auto-1dim:...

ptoulis

feature

sgd
sgd copied to clipboard

Metadata

Add SVMs

Convergence diagnostics

automatic cross validation for parameter tuning

AdaGrad vs d-dim

sparse matrices support?

Add argument for sgd.control for specifying different methods of data subsampling

"Warm start"

restructure and add code for iteratively reweighted least squares

Plot diagnostics

Implement five (5) different modes for setting the learning rate

← Metadata

Owner

Metadata

sgd sgd copied to clipboard

Metadata

← Metadata

Owner

Metadata

sgd
sgd copied to clipboard