Shenggui Li issues

Results 50 issues of


                                            Shenggui Li

[tensor] added mlp implementation for the new sharding spec

The previous PR #1405 implemented the sharding spec. This PR implements the linear distributed computation using the new sharding spec API.

[BUG]: ZeRO not Working with SGD Optimizer

### 🐛 Describe the bug ZeRO will keep throwing overflow if used together with momentum SGD in the [resnet example](https://github.com/hpcaitech/ColossalAI-Examples/tree/main/image/resnet). The code works fine with all kinds of amp. ###...

bug

[RFC]: Integrate Pipeline and Non-Pipeline Model Implementations

### Proposal In the current model zoo and examples, it can be often seen that one model has two different implementations, e.g. GPT and PipelineGPT. This is because that some...

enhancement

[RFC]: Tensor Initialization on Different Devices

### Describe the feature Currently, Colossal-AI requires at least PyTorch 1.8 at this is the lowest version which provides holistic communication operations. However, PyTorch 1.8 does not support directly initialize...

enhancement

[BUG]: MOE Failed with PyTorch 1.8

### 🐛 Describe the bug When running unit test with torch 1.8, the unit tests for moe module failed as shown below. The error occurs because the API of `torch.nn.Linear`...

bug

LAMB is not suited for tensor parallel

The current LAMB optimizer implementation does not support tensor parallel as it needs to compute norm of the whole matrix. It is not compatible with tensor parallel as the tensor...

bug

stale

Shenggui Li

[tensor] added mlp implementation for the new sharding spec

[BUG]: ZeRO not Working with SGD Optimizer

[RFC]: Integrate Pipeline and Non-Pipeline Model Implementations

[RFC]: Tensor Initialization on Different Devices

[BUG]: MOE Failed with PyTorch 1.8

LAMB is not suited for tensor parallel

[bert] update the zero pretraining and finetuning with new zero

[RFC] Merge train_with_engine and train_with_trainer

Provide an Example for Inference

[FEATURE]: Build kernel only when executed