torchexplorer icon indicating copy to clipboard operation
torchexplorer copied to clipboard

Add option for whether or not to enable gradients on the root module inputs

Open spfrommer opened this issue 1 year ago • 0 comments

Right now, in order to ensure that all submodules have input gradients / output gradients, a dummy tensor is added to the inputs of the watch'd module. This adds new computational overhead since now gradients are computed to the input. This should be optional.

spfrommer avatar Nov 15 '23 01:11 spfrommer