Li Junjie issues

Repositories
Issues
Comments

Results 3 issues of


                                            Li Junjie

评论里代码框怎么弄出来

找了半天没找到怎么整

Is the implementation of Fd incorrect?

It should be Fd = − log||∇D − Ex[log D(x)] − Ez[log(1 − D(G(z)))]||. But in terms of code, it's more like implementing Fd = log||∇D||. I think it should...

Loss calculation across GPUs using all_gather_with_grad function

The code uses the `all_gather_with_grad` function to collect the tensor and gradient on all GPUs in order to compute the comparison loss across GPUs. I can successfully train the BLIP-2...