Dzmitry Bahdanau comments

Results 206 comments of


                                            Dzmitry Bahdanau

[WIP] Managing initilizations via a role scheme dictionary

I think the major design decision that I disagree with is having both `initializable_roles` and `parameter_roles`. Neither of the two is fully suitable for controlling how the initialization schemes are...

[WIP] Managing initilizations via a role scheme dictionary

@memimo , we should resume working on this, and I really hope we can finish it until I leave Montreal on June 8.

Graph replacements should act on annotations and aggregations schemes

An addition: same problem for auxiliary variables, they also must be changed. @dwf, I am sort of used to always monitor the training cost on every batch in addition to...

Graph replacements should act on annotations and aggregations schemes

@dmitriy-serdyuk , I am applying replacements after I attached aggregation schemes to some variables, and this does not work. I want to monitor similar expressions on training and vaidation data,...

Graph replacements should act on annotations and aggregations schemes

Thinking about it a bit more, the problem is that annotations are not affected by `ComputationGraph.replace`. That involves their auxiliary variables and updates and might have very nasty consequences, when...

Graph replacements should act on annotations and aggregations schemes

No, #516 is orthogonal. Graph replacement are still not applied to auxiliary variables as they were.

Graph replacements should act on annotations and aggregations schemes

Assuming that you have an aggregation scheme for median, I think it is very possible: ``` python x = tensor.scalar('x') mean_x = aggregation.mean(x) mean_x.name = 'mean_x' median_x = aggregation.median(x) #...

Graph replacements should act on annotations and aggregations schemes

In principle Theano should optimize such repetitions away, and I think it will happen in practice.

Make Parallel bricks play nice with Convolutional bricks

That's probably true that it should not be mandatory for `Fork` to set the `output_dim` attribute of its children. On 4 April 2016 at 02:47, David Warde-Farley [email protected] wrote: >...

Make Parallel bricks play nice with Convolutional bricks

Why not just make it `output_dims` attribute optional for `Parallel` and `Fork`? It can try to set output dimensions if they are given or hope on the brick to compute...