ColossalAI-Examples
ColossalAI-Examples copied to clipboard
Examples of training models with hybrid parallelism using ColossalAI
Dear developers, I am trying to run the bert example but I got this error, any hint to fix it? Thanks
the problems of pipeline
Tutorial [1D Tensor Parallelism](https://www.colossalai.org/docs/features/1D_tensor_parallel) mentioned the use of `all_reduce()`, but the example attached doesn't show us how to do it. Quote: > on each processor, then use an all-reduce to...
### Describe the feature An interesting application can attract a lot of attention. https://github.com/openai/CLIP https://github.com/openai/DALL-E DALL·E 2: https://cdn.openai.com/papers/dall-e-2.pdf https://mp.weixin.qq.com/s/FN9KjuHot547fa68usS98Q
### Describe the feature In most examples, there are two files, namely train with engine and trainer. The code is highly redundant in these two files and we should just...
### Describe the feature Hi, I find that we have provided too many huge models as examples, for instance, we reshape cifar-10 to 224*224 and use ViT Huge(at least not...
### 🐛 Describe the bug When I ran gpt2-vanilla with a batch size of 64, there was a CUDA error `RuntimeError: CUDA error: an illegal memory access was encountered`. Then...
Need to provide an example of doing inference, this should be synced in the documentation as well.
### 🐛 Describe the bug When running BERT sequence example with README, an exception happened. `File "train.py", line 240, in main grad_norm = grad_norm.item() AttributeErrorAttributeError: : 'float' object has no...
### Describe the feature Hi there! Is there any plan for Swin Transformer?