functorch
functorch copied to clipboard
Integrate CUDAGraphs with AOTAutograd
CUDAGraphs should work under NVFuser, so no reason to not do it.
Might be some tricky components about 1. setting it up so it runs after the actual compilation happens, and 2. handling situations where we incur a cuda sync (like dynamic shape operations).
cc: @eellison (this issue could also go on the TorchDynamo repo)
https://github.com/pytorch/pytorch/pull/80566 implements this