ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

Making large AI models cheaper, faster and more accessible

Results 1072 ColossalAI issues
Sort by recently updated
recently updated
newest added

## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...

## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

DevOps

## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...

### 🐛 Describe the bug I am currently running the Colossalai/examples/language/openmoe project with the following experimental setup: datasets: load_dataset("yizhongw/self_instruct/data/finetuning/self_instruct_221203", "super_natural_instructions"), model: openmoe-8b A100_num_GPU:8 Epoch:3 BatchSize:4 LR:0.00001 zero_stage:1 precision:bf16 Boost plugin:ep_zero...

bug

…se.py code style ## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the...

…distribution ## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A...

### 🐛 Describe the bug # Setup I am currently running the Colossalai/examples/language/openmoe project with the following experimental setup: - datasets: `load_dataset("yizhongw/self_instruct", "super_natural_instructions")`, I also tried `"wikitext-2"` - model: `openmoe-base`...

bug