Brian Hirsh issues

Results 24 issues of


                                            Brian Hirsh

FX pass to move input mutations into submodule

When functionalization is turned on in AOT Autograd, we want to hide input mutations in the graph so that the backend compiler doesn't need to worry about seeing `copy_()` ops...

cla signed

`functionalize()` doesn't properly handle aliased program inputs

One reason that functionalization can't be written as a pure graph transform is that its output can depend on the input metadata - specifically whether or not the program inputs...

`grad(factory_op)` breaks with mixed cpu/cuda inputs

This seems like a minor issue, but the following codes breaks: ``` def foo(x): z = torch.zeros(1) # factory func allocating on cpu z.copy_(x) # cuda_tensor.copy_(cpu_tensor) return z.sum() x =...

[test] iOptTensorListRef fix

todo

add torch.autograd._unsafe_set_version_counter API

better description coming soon (but this is meant to fix https://github.com/pytorch/pytorch/issues/91093) Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #92857 * __->__ #92924 * #92588

add torch.autograd._set_view_replay_enabled, use in aot autograd

tldr; this should fix some minor perf regressions that were caused by adding more as_strided() calls in aot autograd. This PR adds a new context manager, `torch.autograd._set_view_replay_enabled()`. Context: AOT Autograd...

release notes: AO frontend

turn functionalization on in aot_autograd inference

still waiting for CI fallout fixes #90759 Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #92857 * #92924 * #92588 cc @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen...

ciflow/trunk

module: inductor

ciflow/inductor

release notes: AO frontend

[prototype, do not land] POC of torch.compiling wrapper tensor subclasses (torchquant)

I spent some time trying to see what it would take to torch.compile() a module that used tensor subclasses, with the torchquant repo as my test example. I have a...

ciflow/inductor

release notes: AO frontend

allow synthetic base code path as long as none of the aliased tensors are subclasses

Fixes https://github.com/pytorch/pytorch/issues/119755. We are prototyping if this situation is generally going to be excercised more heavily when tracing FSDP. This slightly relaxes the assertion for the case in AOTAutograd when...

ciflow/inductor

add a test that non_overlapping checks dont generate too many guards

Pre-emptive test in OSS to ensure that models relying on the "non-overlapping guards" checks do not suffer drastically w.r.t. guard slowness. Current plan is to follow up on this with...

ciflow/trunk

topic: not user facing

module: dynamo