Jiarui Fang（方佳瑞） comments

Results 220 comments of


                                            Jiarui Fang（方佳瑞）

[BUG]: ZeRO causes runtime error when use GRU and pack sequence

Yes, it is on the roadmap of the v0.2.0 version. In that version, we will have a new interface for ZeRO.

The performance of model parallelism (MP) is not good

The DeepSpeed benchmark script https://github.com/feifeibear/DeepSpeedZeRO3Benchmark The PatrickStar https://github.com/Tencent/PatrickStar/blob/master/examples/run_transformers.sh The benchmarking is very easy. ``` export SUFFIX="colossal_compare" env GPU_NUM=8 MODEL_TYPE="GPT" MODEL_NAME=GPT3_10B BS=2 CPU_EBD=0 AMM=1 MSC=1 CACHE=1 SP=0 CS=288 HYB=1 TILING=0 ACT_OFFLOAD=0...

The performance of model parallelism (MP) is not good

I have uploaded the logs of DeepSpeed and PatirckStar to Baidu WangPan... Note that for DeepSpeed, the SamplesPerSec is not equal to 'Throughput'. You have to calculate it by batch/elapse....

ZeRO dose not initialize weight correctly

@1SAA did you fix the issue?

ZeRO dose not initialize weight correctly

Could please get an example to illustrate the not supported cases! @1SAA

[BUG]: Different behaviors when setting param for colossalai.nn.Linear and torch.nn.Linear

You can add a setter in class Linear. Make sure your setting is valid in terms if the tensor shape. colossalai/nn/layer/colossalai_layer/linear.py ``` @property def bias(self, value): self.layer.bias = value ```

[BUG]: Different behaviors when setting param for colossalai.nn.Linear and torch.nn.Linear

The bias shape is different. Users do not know how did you split the bias?

List of feature ideas

> > > > Should I create 3 new issues for these and start working on them? Sure, you can post the issues separately and descript your solution to them....

[BUG]: found inf during ShardedOptimV2 step

@xbasly Did you solve you problem? Can you guys provide a reproducible script for us?

Adding Loki-Promtail configurations for log monitoring

Hello, @SMesForoush , Thanks for your suggestion. I am looking for your PR to the main branch.