Sander Dieleman comments

Results 136 comments of


                                            Sander Dieleman

[December 2014] benchmarking Imagenet winners

Excellent, thanks for the heads up!

[April 2015] Revamp Benchmarks, move to Titan-X (Digits box)

Sweet, looking forward to this! Also slightly jealous ;)

[April 2015] Revamp Benchmarks, move to Titan-X (Digits box)

Awesome work @soumith, thanks for doing this :) Those are some incredible numbers! I knew we had some leeway on Maxwell since almost all the code out there right now...

[April 2015] Revamp Benchmarks, move to Titan-X (Digits box)

That is awesome news! That Python wrapper looks really sweet as well, it should make Theano integration a breeze! I'd be very interested to play around with it :)

how to use dihedral_fast.py？

Here's an example configuration file that uses it: https://github.com/benanne/kaggle-ndsb/blob/master/configurations/convroll_all_broaden_7x7_weightdecay_resume.py#L85

the batch size in the image (2) at run time is different than at build time (10) for the ConvOp

Hi, I will need some more information to be able to help you with this, what was the code that resulted in this error? It would be useful to have...

the batch size in the image (2) at run time is different than at build time (10) for the ConvOp

The implementation follows the formulation in "Biasing RBMs to manipulate latent selectivity and sparsity" by Goh et al, 2010. The sparsity penalty is the _cross entropy_ between the activations and...

the batch size in the image (2) at run time is different than at build time (10) for the ConvOp

That looks like it should work. You can probably drop the term self.sparsity_target**2 since it disappears after taking the gradient anyway. Documentation is a work in progress, I hope to...

the batch size in the image (2) at run time is different than at build time (10) for the ConvOp

'Regular' gaussian units have a mean which is dependent on the input. It is definitely not fixed, else you couldn't really learn much with them :) Regarding fixed variance, there...

the batch size in the image (2) at run time is different than at build time (10) for the ConvOp

> In fact I have to reduce to learning rate greatly, to a value of 0.00001, > for the reconstructions to look reasonable (like digits) at the starting. > May...