Li Junjie
Results
3
issues of
Li Junjie
找了半天没找到怎么整
It should be Fd = − log||∇D − Ex[log D(x)] − Ez[log(1 − D(G(z)))]||. But in terms of code, it's more like implementing Fd = log||∇D||. I think it should...
The code uses the `all_gather_with_grad` function to collect the tensor and gradient on all GPUs in order to compute the comparison loss across GPUs. I can successfully train the BLIP-2...