Hongxin Liu issues

Results 24 issues of


                                            Hongxin Liu

[FEATURE]: support DP for chatgpt making experience

### Describe the feature In our current design, the replay buffer is not distributed. For the consistency and generalization of data sampling during training, each process has a complete copy...

enhancement

chatgpt

[doc] update nvme offload doc

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

[tests] model zoo add torchaudio models

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A...

Run Build and Test

[lazyinit] add correctness verification

## Overview Current implementation is not tested on many models. We have to add large scale correctness verification. Wanna track the development progress? Take a look at proposal: https://github.com/hpcaitech/ColossalAI/discussions/3124 kanban:...

lazyinit

[lazyinit] add verification for distributed cases

## Overview This work should be started after #3148 . And then we have ability create a model with lazy initialiazation and sharding. We have to verify the correctness for...

lazyinit

[lazyinit] combine lazy tensor with dtensor

## Overview We have implemented a single-process version. We may want lazy tensor can be distributed during/after materialization, this feature may be powered by dtensor. Wanna track the development progress?...

lazyinit

[lazyinit] add correctness verification

## 📌 Checklist before creating the PR - [x] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A concise...

Run Build and Test

lazyinit

[booster] add low level zero plugin

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A...

Run Build and Test

API

[devops] fix chat ci

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [x] The title follows the standard format: `[doc/gemini/tensor/...]: A...

bug

DevOps

chatgpt

[devops] fix ci error due to version conflict

## 📌 Checklist before creating the PR - [ ] I have created an issue for this PR for traceability - [ ] The title follows the standard format: `[doc/gemini/tensor/...]:...

DevOps