Wancong (Kevin) Zhang
Results
2
comments of
Wancong (Kevin) Zhang
has anyone found the solution to this bug? I also encountered it.
Another difference is that clip_grad_norm and clip_model_grad_norm are set to -1 in [sgim_finetune.sh](https://github.com/mila-iqia/SGI/blob/master/scripts/experiments/sgim_finetune.sh), whereas in the SPR paper you clipped the gradients to 10