torchexplorer
torchexplorer copied to clipboard
Add option for whether or not to enable gradients on the root module inputs
Right now, in order to ensure that all submodules have input gradients / output gradients, a dummy tensor is added to the inputs of the watch
'd module. This adds new computational overhead since now gradients are computed to the input. This should be optional.