ColossalAI-Examples icon indicating copy to clipboard operation
ColossalAI-Examples copied to clipboard

Examples of training models with hybrid parallelism using ColossalAI

Results 37 ColossalAI-Examples issues
Sort by recently updated
recently updated
newest added
trafficstars

Dear developers, I am trying to run the bert example but I got this error, any hint to fix it? Thanks

the problems of pipeline

Tutorial [1D Tensor Parallelism](https://www.colossalai.org/docs/features/1D_tensor_parallel) mentioned the use of `all_reduce()`, but the example attached doesn't show us how to do it. Quote: > on each processor, then use an all-reduce to...

### Describe the feature An interesting application can attract a lot of attention. https://github.com/openai/CLIP https://github.com/openai/DALL-E DALL·E 2: https://cdn.openai.com/papers/dall-e-2.pdf https://mp.weixin.qq.com/s/FN9KjuHot547fa68usS98Q

### Describe the feature In most examples, there are two files, namely train with engine and trainer. The code is highly redundant in these two files and we should just...

### Describe the feature Hi, I find that we have provided too many huge models as examples, for instance, we reshape cifar-10 to 224*224 and use ViT Huge(at least not...

### 🐛 Describe the bug When I ran gpt2-vanilla with a batch size of 64, there was a CUDA error `RuntimeError: CUDA error: an illegal memory access was encountered`. Then...

Need to provide an example of doing inference, this should be synced in the documentation as well.

documentation

### 🐛 Describe the bug When running BERT sequence example with README, an exception happened. `File "train.py", line 240, in main grad_norm = grad_norm.item() AttributeErrorAttributeError: : 'float' object has no...

### Describe the feature Hi there! Is there any plan for Swin Transformer?