large-batch-training
large-batch-training copied to clipboard
pytorch gpu
Is there any problem that makes the implementation of GPU version difficult? I tried to get a linear combination of SB weights and LB weights in GPU mode, and got weird issues. Did you have similar problems before?