Samantha Andow comments

Results 39 comments of


Samantha Andow

No differentiated Tensors in the graph when using autograd.grad with functorch

The error: `One of the differentiated Tensors appears to not have been used in the graph. Set allow_unused=True if this is the desired behavior.` means that the derivative with respect...

Multiple gradient calculation for single sample

> When replacing jacrev with jacfwd, the following error occurs ... Ahh sorry that's my fault, I'll put up a patch to fix that today. In the meantime, if you...

Multiple gradient calculation for single sample

To check @AlphaBetaGamma96's intuition that it might just be an OOM issue, I know you're able to compute the forward pass but are you able to compute just gradients on...

Multiple gradient calculation for single sample

@JoaoLages Sorry for the delay in response, do you have an E2E repro that you could share? We're trying to understand if it's going to be better to recommend using...

memory_efficient_fusion leads to RuntimeError for higher-order gradients calculation. RuntimeError: You are attempting to call Tensor.requires_grad_()

cc @Chillee @anijain2305 Any thoughts? In particular re: why memory_efficient_fusion made the the final case slower

Samantha Andow

No differentiated Tensors in the graph when using autograd.grad with functorch

Multiple gradient calculation for single sample

Multiple gradient calculation for single sample

Multiple gradient calculation for single sample

memory_efficient_fusion leads to RuntimeError for higher-order gradients calculation. RuntimeError: You are attempting to call Tensor.requires_grad_()

Get wrong jacobian from copyslice operation

Possible (-2 to 4%) regression in functorch_dp_cifar10_cuda model from 0.1.1 to latest

Possible (-2 to 4%) regression in functorch_dp_cifar10_cuda model from 0.1.1 to latest

Simultaneous computation of per-sample gradient and per-batch gradient