x

Results 5 comments of x

想问训练的时候您用了几个GPU,GPU的内存需要多大?

非常感谢老师的解答

I'm confused of the "activation dimension (L)". Normally, it is the channel dimension. I'm very curious about the activation dimension (L).