tf_unet fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout …

Python Dropout op uses the following code to check keep_prob value: if tensor_util.constant_value(keep_prob) == 1: return x If keep_prob is placeholder, tensor_util.constant_value(keep_prob) will return None, if statement will always be false.

Jul 30 '18 07:07 mpjlu

Thx for you contribution. I see why this is better during training. But how should we control the dropout during validation and prediction? There we want to set the dropout to 1. Or am I missing something?

Jul 31 '18 06:07 jakeret

For prediction, we don't need dropout. If set dropout to 1. The right behavior is dropout layer return directly.

Aug 02 '18 02:08 mpjlu

Right. So during training we want dropout to be < 1 and during validation it should be = 1. How can we control this?

Aug 02 '18 10:08 jakeret

We can create two Unet with different keep_prob for training and validation. How do you think about it? Since the Dropout layer is very time-consuming, it is better to skip Dropout during validation and inference.

Aug 03 '18 01:08 mpjlu

Don't we have to train two models then? I wasn't aware that dropout is so time consuming. How much does it affect training/validation performance?

Aug 03 '18 06:08 jakeret

For inference, the Dropout is about 16% of iteration time. Second row of this picture. We don't need to train two model. Just need to create a new model (model with keep_prob = 1) for the inference/validation.

Aug 06 '18 07:08 mpjlu

Hi @jakeret , any comment on the data. The data is based on CPU.

Aug 15 '18 02:08 mpjlu

An 16% performance improvement is nice. However, i still don't fully understand how the training/validation procedure would look like. If a new model is created for validation, how would you transfer the learned weights?

Aug 16 '18 12:08 jakeret

I am sorry for reply later.
How about input two nets when creating Trainer object. The train_net for train, and the validation_net for validation. train_net can save the model for each epoch, and validation_net can restore the model for validation. What do you think about it?

Sep 18 '18 05:09 mpjlu

I don't see how this should be implemented. The computation-graph would be different for the two networks, which makes it hard to transfer the weights from one to the other

Sep 19 '18 14:09 jakeret

There is no weight for the dropout layer, it is ok to save model in the train net, and restore them in the validation net.

Sep 20 '18 12:09 mpjlu

tf_unet tf_unet copied to clipboard

fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout …

tf_unet
tf_unet copied to clipboard