torchdyn How to get memory usage for "adjoint" and "autograd" method?

How to get memory usage for "adjoint" and "autograd" method?

Open cyx96 opened this issue 3 years ago • 1 comments

Thanks for this amazing package!

I was trying to test the memory usage of adjoint, as claimed by authors of the original neural ODE paper, the memory usage of adjoint method should be smaller compared to vanilla "autograd". However, the output of torch.cuda.memory_summary() show an increase of GPU memory of the adjoint method compared to autograd. I'm wondering if I used torch.cuda.memory_summary() wrong, I printed it after the training. If my approach was incorrect, what is the correct way to get memory usage for "adjoint" and "autograd" method?

May 10 '22 22:05 cyx96

Hey, did you happen to make progress on this? I am curious to know and can hopefully provide some benchmarks when I get my problem running as well

Jul 11 '22 18:07 joglekara

torchdyn torchdyn copied to clipboard

How to get memory usage for "adjoint" and "autograd" method?

torchdyn
torchdyn copied to clipboard