R Devon Hjelm
Results
53
comments of
R Devon Hjelm
What is the high-level goal 2-4?
OK, let's see your PR then. Thanks for working on this!
It's been a while since I've looked at this code, but the disconnected_grad is a trick to specify a loss such that their gradient looks like the importance sampled version.