bitgs
Results
1
issues of
bitgs
This gradient cliping sentence seems wrong. clip_grad_var = [gv if gv[0] is None else [tf.clip_by_norm(gv[0], 100.), gv[1]] for gv in gvs] when gv[0]==0, gv[0] should be clipped. But here, you...