PiPPy issues

Pippy ddp2pipe example doesn't work for pipeline

4

Hi, I’m using pippy for PP+DP. I ran the following code. [pytorch/tau/blob/main/examples/ddp2pipe/ddp2pipe.py](https://github.com/pytorch/tau/blob/main/examples/ddp2pipe/ddp2pipe.py) I set the DIMS, PP LAYERS, DP LAYERS like this, DIMS = [28 * 28, 300, 100, 30,...

yurishin929

[SPMD] Missing DT support NotImplementedError: Operator aten.amax.default does not have a DistributedTensor rule registered.

This error is thrown when using torch.nn.CrossEntropyLoss() with SPMD API.

anj-s

[SPMD] Add support for convolution ops to DTensor sharding prop

Currently MNIST benchmark fails due to unsupported convolution ops in the DTensor registry. Error: NotImplementedError: Operator aten.convolution.default does not have a DistributedTensor rule registered.

anj-s

[DTensor] missing rule for aten.fill.Scalar causing unit tests to fail for SPMD

PR's for tau are failing due to an unrelated missing rule for DTensor: "Operator aten.fill.Scalar does not have a DistributedTensor rule registered." Details: Traceback (most recent call last): File "/__w/tau/tau/test/spmd/tensor/test_dtensor_ops.py",...

lessw2020

Issue with FX tracing of HF seq2seq models

### What is the issue: Using pippy for HF model [inference](https://github.com/pytorch/tau/tree/main/examples/inference), it uses FX tracer under the hood. Seq2Seq models such as T5, or decoders such as OPT, bloom that...

HamidShojanazeri

Unify argument parsing across tests and examples to avoid code duplication

3

Subtask of https://github.com/pytorch/PiPPy/issues/299

pbelevich

PiPPy

better engineering

PiPPy
PiPPy copied to clipboard

Metadata

Pippy ddp2pipe example doesn't work for pipeline

[SPMD] Missing DT support NotImplementedError: Operator aten.amax.default does not have a DistributedTensor rule registered.

[SPMD] Add support for convolution ops to DTensor sharding prop

[DTensor] missing rule for aten.fill.Scalar causing unit tests to fail for SPMD

Issue with FX tracing of HF seq2seq models

Unify argument parsing across tests and examples to avoid code duplication

[SPMD][Fusion] add bucket size/ num_bytes policy for fusion

[SPMD][Fusion] - ensure matching ProcessGroups for fused comm calls

[SPMD][Fusion] - ensure buffer dtype matches gradient tensor dtype

[SPMD][Fusion] Add unit tests for fusion

← Metadata

Owner

Metadata

PiPPy PiPPy copied to clipboard

Metadata

← Metadata

Owner

Metadata

PiPPy
PiPPy copied to clipboard