ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

Making large AI models cheaper, faster and more accessible

Results 1072 ColossalAI issues
Sort by recently updated
recently updated
newest added

### Describe the feature **Describe the feature:** `Training in a large scale env with multiple DP group strategy will load huge data load throughtput. Ex: Training LLAMA2-70B (the checkpoint is...

enhancement

### ๐Ÿ› Describe the bug i'm using the colossalai to train the animate anyone. and the training need to freeze some layers in model. but that would lead to assert...

bug

## ๐Ÿ“Œ Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

## ๐Ÿ“Œ Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

## ๐Ÿ“Œ Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A...

### Discussed in https://github.com/hpcaitech/ColossalAI/discussions/5393 Originally posted by **zhpacer** February 20, 2024 CUDA: Build cuda_11.8.r11.8/compiler.31833905_0 following: https://github.com/hpcaitech/ColossalAI/tree/main/applications/Colossal-LLaMA-2 Error: utils/flash_attention_patch.py", line 22, in from colossalai.accelerator import get_accelerator ModuleNotFoundError: No module named 'colossalai.accelerator'

## ๐Ÿ“Œ Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

### ๐Ÿ› Describe the bug return self._op(*args, **kwargs or {}) File "/opt/conda/lib/python3.9/site-packages/torch/_prims/__init__.py", line 292, in _backend_select_impl return meta(*args, **kwargs) File "/opt/conda/lib/python3.9/site-packages/torch/_prims/__init__.py", line 2410, in _iota_meta return torch.empty( File "/opt/conda/lib/python3.9/site-packages/colossalai/lazy/lazy_init.py", line...

bug

### ๐Ÿ› Describe the bug ๆ‚จๅฅฝ๏ผŒๅฝ“ๆˆ‘ๅช่ฐƒ็”จ`from colossalai.nn.layer.moe import Experts, MoeLayer, Top1Router, NormalNoiseGenerator`ๆ—ถๅ€™ๅ‡บ็Žฐไปฅไธ‹้—ฎ้ข˜๏ผš `Traceback (most recent call last): File "/home/songty/working_dir/moe/main.py", line 2, in from colossalai.nn.layer.moe import Experts, MoeLayer, Top1Router, NormalNoiseGenerator File...

bug

## ๐Ÿ“Œ Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...