ColossalAI-Examples
ColossalAI-Examples copied to clipboard
Examples of training models with hybrid parallelism using ColossalAI
### Describe the feature Hi all, I'm trying to use a GPT model I trained using ColossalAI with huggingface/transformers for inference but it's not possible to load the model as...
### 📚 The doc issue It seems the pipeline parallel document is out of date(https://www.colossalai.org/docs/features/pipeline_parallel). When I try to use from colossalai.builder import build_pipeline_model, it seems it's none
https://github.com/hpcaitech/ColossalAI-Examples/tree/main/image/detr-debug File" python3.8/site-packages/colossalai/pipeline/pipelinable.py", line 128, in to_layer_list layer_spec = self._layer_spec_dict[id(child)] KeyError: 140687581920992
### 🐛 Describe the bug I try to run a config by using the [train_gpt.py](https://github.com/hpcaitech/ColossalAI-Examples/blob/main/language/gpt/train_gpt.py). I add a model on the [gpt.py](https://github.com/hpcaitech/Titans/blob/main/titans/model/gpt/gpt.py) . ``` def gpt2_test4gpu350M(**kwargs): model_kwargs = dict(hidden_size=1024, depth=24,...
### 🐛 Describe the bug File "/workspace/ColossalAI-Examples/image/detr/models/transformer.py", line 10, in from titans.layer.attention import DeTrAttention ImportError: cannot import name 'DeTrAttention' from 'titans.layer.attention' (/opt/conda/lib/python3.8/site-packages/titans/layer/attention/__init__.py) ### Environment _No response_
### 🐛 Describe the bug when I try to run the code [pipeline_gpt1d.py](https://github.com/hpcaitech/ColossalAI-Examples/blob/main/language/gpt/model/pipeline_gpt1d.p), there is a error, where I can't find the [import model_zoo.gpt.gpt as col_gpt](https://github.com/hpcaitech/ColossalAI-Examples/blob/main/language/gpt/model/pipeline_gpt1d.py#L12). So, what's the model_zoo.?...
### 🐛 Describe the bug Excuse me. When I enter the command "colossalai run --nproc_per_node 4 --host [host1 ip addr],[host2 ip addr] --master_addr [host1 ip addr] train.py", I got this...