opensphere
opensphere copied to clipboard
Why use no_grad for computing d_theta
hello, i found this code ignore grad for g_cos_theta and angular margin. https://github.com/ydwen/opensphere/blob/main/model/head/sphereface2.py#L62-L80
Will this not cause network oscillation?