pytorch issues

[BE][Easy][11/19] enforce style for empty lines in import segments in `test/dy*/`

1

See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #129762...

XuehaiPan

open source

better-engineering

topic: not user facing

module: dynamo

move pp tests

2

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #129802 * __->__ #129801 * #129800

mori360

oncall: distributed

Torch Distributed Pipelinging API datatype mismatch

5

### 🐛 Describe the bug Hello, I encountered some issues while using torch.distributed.pipelining. I tested PiPPy/examples/huggingface/pippy_gpt2.py with the default configuration. Because I'm working on full model testings, I added a...

YingLiGithub

oncall: distributed

module: pipelining

[audio hash update] update the pinned audio hash

19

This PR is auto-generated nightly by [this action](https://github.com/pytorch/pytorch/blob/main/.github/workflows/nightly.yml). Update the pinned audio hash.

pytorchupdatebot

open source

ciflow/trunk

topic: not user facing

ciflow/inductor

RuntimeError: CUDA error: unspecified launch failure

38

## Issue description > RuntimeError: CUDA error: unspecified launch failure Error occurring on any training script. Occurrence is not deterministic. Can occur at anytime during the course of training. All...

prabhatkumar95

module: cudnn

module: cuda

triaged

Grouped Query Attention

2

### Approach: Using the current function declaration **Constraint:** Q_Heads % KV_Heads == 0 **Major change:** It adds a meaning to the last third dimension. **Pros:** This approach covers one major...

jainapurva

SequenceParallel sharding seems wrong

9

### 🐛 Describe the bug According to the documentation `torch.distributed.tensor.parallel.SequenceParallel` should shard on the sequence dimension i.e. `[B, T, C] -> [B, T//_world_size, C]` but it seems to be tiling...

marib00

oncall: distributed

triaged

[Traceable FSDP2][Inductor] Re-inplace all_gather_into_tensor

1

FSDP2 eager pre-allocates the output buffer for AllGather and the AllGather just writes into that buffer. However, under compile, by default we use out-of-place AllGather, which means in Traceable FSDP2...

yf225

oncall: distributed

release notes: distributed (fsdp)

module: inductor

ciflow/inductor

[OSS TEAM TEST] Test SLP flow

4

Fixes #ISSUE_NUMBER

facebook-github-bot

open source

topic: not user facing

[BE] Make ActivationWrapper an abstract class

6

Fixes #95481 Test Plan: Unit tested checkpoint_wrapper.py by instantizing ActivationWrapper and got TypeError as expected. cc @mrshenli @pritamdamania87 @zhaojuanmao @satgera @gqchen @aazzolini @osalpekar @jiayisuse @H-Huang @kwen2501 @awgu @fegin @XilunWu @wanchaol...

jovianjaison

oncall: distributed

ciflow/trunk

module: distributed

pytorch
pytorch copied to clipboard

Metadata

[BE][Easy][11/19] enforce style for empty lines in import segments in `test/dy*/`

move pp tests

Torch Distributed Pipelinging API datatype mismatch

[audio hash update] update the pinned audio hash

RuntimeError: CUDA error: unspecified launch failure

Grouped Query Attention

SequenceParallel sharding seems wrong

[Traceable FSDP2][Inductor] Re-inplace all_gather_into_tensor

[OSS TEAM TEST] Test SLP flow

[BE] Make ActivationWrapper an abstract class

← Metadata

Owner

Metadata

pytorch pytorch copied to clipboard

Metadata

← Metadata

Owner

Metadata

pytorch
pytorch copied to clipboard