BlazeFace_Person.pytorch
BlazeFace_Person.pytorch copied to clipboard
Help in training
Hi everyone,
I have been trying to train the model using the train_Blazeface.py but it seems the loss is always Nan for me. Could someone please guide me on it? Thank you.
Hi all, just an update, I was able to go until iteration 320 without nan by following steps:
- making the learning rate = 1e-7
- batch size = 16
- And adding following lines in ssd_model.py :
- N = num_pos.data.sum().double()
- loss_l = loss_l.double() / N
- loss_c = loss_c.double() / N
This helped me get loss values for iterations less than 320 but as soon as I reach the further, the loss becomes nan. I have looked up all possible solutions but I am not able to understand further. If someone could help me by providing some leads, I would be really very grateful. Looking forward to any direction. Thank you.