ErenChan

Results 6 issues of ErenChan

### 🐛 Describe the bug Hi, I use py37 while math.prod is New in version 3.8. I suggest replace the math.prod with other func. such as: ```python from functools import...

bug

### 📚 The doc issue https://github.com/hpcaitech/ColossalAI/blob/main/examples/language/gpt/titans/README.md#29 there are two different cmds and they have the same comments. # run on multiple nodes with slurm I think the first may be...

documentation

### 🐛 Describe the bug I want to use DP=2, TP=2 on 2GPUS. and the info is as bellow: ``` INFO: Distributed environment is initialized, data parallel size: 2, pipeline...

bug

### Describe the feature I often met the problem "RuntimeError: The server socket has failed to listen on any local network address." Usually, I manually run the script several times...

enhancement

### 🐛 Describe the bug Hi, It works normally with PP=2 on 2GPUs. Refer to another question https://github.com/hpcaitech/ColossalAI/issues/2535 But, it seems blocked when I run with PP=4 on 4GPUs, and...

bug

### 🐛 Describe the bug I can run with pp=8,tp=1 w/o zero strategy. myconfig is ```python # from model import GPT2_small_pipeline_hybrid from model import GPT_13b_pp1d import torch from colossalai.nn.optimizer import...

bug