Jiarui Fang(方佳瑞)

Results 220 comments of Jiarui Fang(方佳瑞)

Yes, it is on the roadmap of the v0.2.0 version. In that version, we will have a new interface for ZeRO.

The DeepSpeed benchmark script https://github.com/feifeibear/DeepSpeedZeRO3Benchmark The PatrickStar https://github.com/Tencent/PatrickStar/blob/master/examples/run_transformers.sh The benchmarking is very easy. ``` export SUFFIX="colossal_compare" env GPU_NUM=8 MODEL_TYPE="GPT" MODEL_NAME=GPT3_10B BS=2 CPU_EBD=0 AMM=1 MSC=1 CACHE=1 SP=0 CS=288 HYB=1 TILING=0 ACT_OFFLOAD=0...

I have uploaded the logs of DeepSpeed and PatirckStar to Baidu WangPan... Note that for DeepSpeed, the SamplesPerSec is not equal to 'Throughput'. You have to calculate it by batch/elapse....

@1SAA did you fix the issue?

Could please get an example to illustrate the not supported cases! @1SAA

You can add a setter in class Linear. Make sure your setting is valid in terms if the tensor shape. colossalai/nn/layer/colossalai_layer/linear.py ``` @property def bias(self, value): self.layer.bias = value ```

The bias shape is different. Users do not know how did you split the bias?

> > > > Should I create 3 new issues for these and start working on them? Sure, you can post the issues separately and descript your solution to them....

@xbasly Did you solve you problem? Can you guys provide a reproducible script for us?

Hello, @SMesForoush , Thanks for your suggestion. I am looking for your PR to the main branch.