Brian Hirsh issues

Results 24 issues of


                                            Brian Hirsh

[test fix] try to re-use cached symnodes across dynamo and AOTAutograd

More to generate discussion (maybe we want to land this? But the state of caching here feels pretty fragile). Partial fix to the issue here: https://fb.workplace.com/groups/1075192433118967/permalink/1381371379167736/ It looks like we...

release notes: fx

ciflow/inductor

dont treat size==1,stride==0 from inductor as diverging strides from eager w.r.t dynamic shapes

Fixes https://github.com/pytorch/pytorch/issues/116433. Putting this out as a tentative fix, but more discussion is in the github issue. Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #116435

ciflow/inductor

partial fix for buffer mutations with aot_compile

Partial fix for https://github.com/pytorch/pytorch/issues/120424. @int3 to continue investigation. Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #120427 cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy...

module: inductor

ciflow/inductor

fix FakeTensor creation on noncontiguous subclasses

Fixes https://github.com/pytorch/pytorch/issues/125287 Fixes https://github.com/pytorch/pytorch/issues/124090, context on the issue Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #124400 * __->__ #124399 * #124398 cc @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen @aazzolini @osalpekar...

oncall: distributed

oncall: pt2

module: dynamo

ciflow/inductor

AOTAutograd: force tangents to be contiguous when subclass inner tensor is noncontiguous

Fixes https://github.com/pytorch/pytorch/issues/124397 Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #124400 * #124399 * #124398

oncall: distributed

ciflow/inductor

Nested wrapper subclasses with torch.compile is broken

This came in in the FSDP2 workstream, which needs a DTensor that holds some sort of float8 tensor (cc @Chillee @ezyang @zou3519 @albanD @samdow @msaroufim @anijain2305 @chauhang @awgu / @drisspg...

triaged

module: __torch_dispatch__

tensor subclass

oncall: pt2

module: pt2-dispatcher

support as_python_constant on PlacementClassVariable

Fixes an error for torchtitan + internal Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #124400 * #124399 * __->__ #124398 cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv...

ciflow/trunk

module: dynamo

ciflow/inductor

release notes: dynamo

Reland "make sure dynamo doesn't inline DTensor new or __torch_dispatch__ (#123347)"

Re-land of https://github.com/pytorch/pytorch/pull/123347. The original PR broke internal because of a circular import due to importing dynamo in the DTensor code. The new version uses `torch._dynamo_disable` to work around Stack...

oncall: distributed

ciflow/inductor

torch.compile + constructing an nn.Parameter + mutating it can give wrong results

Example repro: ``` import torch def f(x): y = x + 1 z = torch.nn.Parameter(y) with torch.no_grad(): z.mul_(2) return y + z x = torch.ones(2, requires_grad=True) out_ref = f(x) out_test...

high priority

triage review

oncall: pt2

module: dynamo

functionalize storage resizing, minimal ppFSDP traceable forward

More details further down, but first a more high-level description of "how do we functionalize storage resizing" Today, dynamo converts `param.untyped_storage().resize_(x)` calls that it sees from fsdp into a custom...

module: inductor

module: dynamo

ciflow/inductor