VasundharaAgarwal
VasundharaAgarwal
> Using a server learning rate = 1 should be just as good as anything else. I believe that's what we observed. Thanks! Should I use that value for all...
> My impression is that yes, this should work well for all noise multipliers and clipping quantiles. I will check with the paper authors and get back to you on...
Thank you so much @galenmandrew, that's very helpful. I'm not sure I understand how a learning rate of 0.0 for EMNIST-CR adaptive would work. Surely the model won't get updated?