He Ma
He Ma
I just made some changes regarding Cifar10_model and Wide_ResNet. You may want to pull it from master. As for your hanging problem, I would recommend debugging it from [here](https://github.com/uoguelph-mlrg/Theano-MPI/blob/master/theanompi/worker.py#L94). Put...
@Nqabz The memory allocation part looks weird to me. I don't have this configured anywhere ( like cnmem in .theanorc) and I don't see this in my standard output and...
@deepali-c The changes for making the single GPU train.py working would involve changing any `sandbox.cuda` functions to `gpuarray` alternatives, using device='cuda0' instead of device='gpu0', and moving any `import theano` after...
@deepali-c The error looks like something with `floatX`. Anyways, I just created a [pygpu branch](https://github.com/uoguelph-mlrg/theano_alexnet/tree/pygpu). And the single GPU `train.py` is working. You can compare your patch with [this commit](https://github.com/uoguelph-mlrg/theano_alexnet/commit/5eba9808ec13820fe7a917b869c8050a32950b82)...
@deepali-c I just made the `train_2gpu.py` working based on pygpu collectives, which is then based on NCCL. So you need to install [NCCL](https://github.com/NVIDIA/nccl), [libgpuarray](http://deeplearning.net/software/libgpuarray/installation.html#step-by-step-install-user-library) and its wrapper pygpu in order...
@deepali-c Sorry, I forgot to debug the validation part. See the [last commit](https://github.com/uoguelph-mlrg/theano_alexnet/commit/31222da3f5ba7ec4b3bc78a8ae5a95ebf80aaa77#diff-238fca2c2416136f03efe7a9d712964dL356) regarding this issue. The `exch.exchange()` is for exchanging the `total_params`. Here what we need is to average...
@G33kyKitty Theano version **=0.7~0.9** for the master branch, **>0.9** for the pygpu branch. For other dependencies, just using the **latest** version will be fine I think. But I haven't tested...
I just finished one testing of the RGB regularization. The result shows that when using it in training and not using it in validation, validation error 43.575721 % top 5...
@gwding Sorry. I just checked the job. It seems the RGB augmentation doesn't improve much. Using alpha=0.1 even makes the error higher: validation error 55.723157 % top 5 validation error...
@aryanbhardwaj Which Theano backend are you using? Maybe you want to try the [pygpu branch](https://github.com/uoguelph-mlrg/theano_alexnet/tree/pygpu) to use the new Theano backend. There are some [commits](https://github.com/uoguelph-mlrg/theano_alexnet/commit/5eba9808ec13820fe7a917b869c8050a32950b82#diff-07373eb9d5c32041ce70704f1e321cb8R21) addressing this int conversion in...