Ray

Results 2 issues of Ray

When i am training the ArbRCAN model, the training is so large that it is a normal situation. Sometime the loss so so large that skip the batch. And i...

I have noticed some parameters in the code, such as kv_downsample diff_routing and so on. What are the functions of the parameters? Howwill they influence the model?