ColossalAI
ColossalAI copied to clipboard
Making large AI models cheaper, faster and more accessible
## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...
## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...
## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...
## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...
## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...
### 🐛 Describe the bug I am currently running the Colossalai/examples/language/openmoe project with the following experimental setup: datasets: load_dataset("yizhongw/self_instruct/data/finetuning/self_instruct_221203", "super_natural_instructions"), model: openmoe-8b A100_num_GPU:8 Epoch:3 BatchSize:4 LR:0.00001 zero_stage:1 precision:bf16 Boost plugin:ep_zero...
…se.py code style ## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the...
…distribution ## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A...
### 🐛 Describe the bug # Setup I am currently running the Colossalai/examples/language/openmoe project with the following experimental setup: - datasets: `load_dataset("yizhongw/self_instruct", "super_natural_instructions")`, I also tried `"wikitext-2"` - model: `openmoe-base`...