Jiewen Tan issues

Results 9 issues of


                                            Jiewen Tan

Add the ability to trace TensorList in-place ops

Summary: To trace through c10d::all_gather, AOT needs to support TensorList in-place ops. Companion PyTorch PR: [pytorch#77940](https://github.com/pytorch/pytorch/pull/77806). Test Plan: WIP.

cla signed

Requiring user activation to call WebAuthn API

Unsolicited dialogs or alerts are often disruptive and hated by users. The Level 1 spec didn’t require and foresee that disruptive UI would be shown in response to makeCredential or...

type:technical

stat:breaking

[distributed] Update c10d.new_group signature

Author: @kumpera Summary: A companion change to pytorch/pytorch#84224. Test Plan: CI.

Running tests on XLA:GPU takes forever

It takes forever to run any tests on XLA GPU. And suspicious messages are shown: ``` (pytorch) jwtan@jwtan-v100-4:~/work/pytorch/xla$ MASTER_ADDR=localhost MASTER_PORT=6000 LD_LIBRARY_PATH=/opt/conda/lib/ python test/test_ddp.py TestXrtDistributedDataParallel.test_ddp_correctness Running tests under Python 3.10.6: /opt/conda/envs/pytorch/bin/python3...

triaged

xla:gpu

[DDP] Add a test case to test a larger model

Summary: This commit adds a test case to test a larger model that can trigger multiple all_reduces instead of one. Test Plan: XRT: MASTER_ADDR=localhost MASTER_PORT=6000 python test/test_ddp.py TestXrtDistributedDataParallel.test_ddp_correctness_large_net PJRT: PJRT_DEVICE=TPU...

triaged

xla/test/test_ddp.py is flaky in GPU

xla/test/test_ddp.py is flaky in GPU. Investigate and reenable it.

triaged

ddp

Jiewen Tan

Add the ability to trace TensorList in-place ops

Requiring user activation to call WebAuthn API

[distributed] Update c10d.new_group signature

Running tests on XLA:GPU takes forever

[DDP] Add a test case to test a larger model

xla/test/test_ddp.py is flaky in GPU

functorch.functionalize fails with exponential_ extreme behavior

[Pallas] Introduce make_kernel_from_pallas

Early exit for clip_grad_norm_