Results 9 comments of Fariha

I am using 4 Tesla T4 gpus and I could only get to a batch size of 2 per gpu for distributed training. All the other hyper parameters are same...

Thank you so much for such a detailed answer. 1-I am only calculating the f1-score for segmentation only, from code it may seem like I am using boundary and distance...

I tried the gradient accumulation way and got to accumulate gradients after 32 samples. The training is still not stable and I tried with a really low learning rate as...

Here, '#' before a line represents that it is commented. ```python from mxnet import nd from models.changedetection.mantis.mantis_dn import * from mxnet import autograd,optimizer,gluon import mxnet as mx import horovod.mxnet as...

Thank you once again for such a detailed reply. Okay so when I calculate f1 score against class axis, it is actually good. So for visualising results and data, I...

![img42](https://user-images.githubusercontent.com/8082797/99346115-a6596480-28b5-11eb-88fc-34e5efcfc7eb.png) ![img43](https://user-images.githubusercontent.com/8082797/99346300-21bb1600-28b6-11eb-9bae-4cd3eb15460b.png) These are the results I got on a test image. I used the sliding window method to perform inferences. The third and fourth images represent ground truth and...

Thank you, you helped a lot. Yes that is true, it needs more training. I wanted to ask if I lower the image size and then apply the sliding window...

@humayunah The 'none' value comes out when score for that label does not exist because that class does not exit in the data point. So you will have to decide...