Andreas Mueller

Results 411 comments of Andreas Mueller

It would be great to have a work-around for this, I'd really like to use this dataset.

I think I had spaces after the comma, that might have been the issue. Thank you! Version two is my fork IIRC :)

FYI it seems that if you fork a dataset, it keeps the owner by default. I'm not sure if that's intentional?

Hm ok so this is the last person that edited it? Because 45705 was the one I created and it's now "uploaded" by you.

can you try to reproduce it with the command line interface? Otherwise it might be numerical issues caused by us (sklearn). Also, how about scaling your data ;)

Any update? Adding any CC license for clarity would be great, ideally CC0 as mentioned above.

The SGD in scikit-learn actually has an adaptive learning rate - it can even be set to be the same as pegasos, I believe. For the projection step, the claims...

After looking it up again, I think you need to set `power_t=1` to get the pegasos schedule.

Wow that looks quite good. I'm quite surprised your implementation is significantly faster than sklearn. Do you have any idea where that could come from? Also, could you please share...

You say that training on random samples makes it had to compare speed.s How so? One iteration of sgd are `n_samples` many updates, which you should compare against `n_samples` many...