Rupesh K Srivastava comments

Results 48 comments of


                                            Rupesh K Srivastava

Loading Caffe models

This requires conversion from models for NCHW format (Caffe) to those for NHWC (Brainstorm), so it's not straightforward, but should still be possible.

Loading Caffe models

Cool, looking forward to it! NHWC layout makes things like this a bit trickier, but we think it's the better format for the long run. Plus, cuDNN v4 will fully...

Loading Caffe models

Does Keras also use NHWC? We'd like to have a more general approach (full DAG). It's fine to start with handling simpler cases, with extensibility in mind. Brainstorm also works...

SquaredDifference values halved?

Good point. You're right, the layer implementation actually computes half of the squared difference (and the gradients accordingly), so `SquaredDifference` is a misnomer. We should do something about this. (CC...

SquaredDifference values halved?

The backward pass implementation could simply multiply deltas by 2, so the gradient check would work fine. Edit: I meant to say, sure we'll have to modify the backward pass,...

SquaredDifference values halved?

Here's a plan for this issue. We'll change `SquaredDifference` layer so that it computes the correct squared difference. We'll add a new layer (let's call it `SquaredLoss`, subject to change)...

SquaredDifference values halved?

I'm done with making the above change in a private branch. I've named the new layer `SquaredLoss`, but perhaps `SquaredError` or something else would be better? (Caffe calls it EuclideanLoss).

SquaredDifference values halved?

I agree, Euclidean loss is not really a name commonly used in NN literature. `MSE` is. `CE` is a good suffix, but also not commonly used in regression context. A...

SquaredDifference values halved?

I now realize that `EuclideanDistance` would clearly not be a correct name either.

SquaredDifference values halved?

:D Good point. However, I think that `SquaredError` is probably the best name for the new layer, even though it halves the error. The `Error` suffix can act as a...