binmakeswell

Results 359 comments of binmakeswell

Hi @bobo0810 We have replied it at #2937 Thanks.

This issue was closed due to inactivity. Thanks.

Hi @lupingllp and @grafail , just as the bug hint, you need to import the CodeGenAttention class rather than only replace the model. We are also working at 'lazy init',...

Hi @pilipala818 Could you please provide more details about your issue? It's hard for us to follow/reproduce/fix it now. Thanks.

We have updated a lot. Please check the latest code. This issue was closed due to inactivity. Thanks.

> titan pipeline启动太慢了,单机8卡 3090卡,pp=4, tp=2,启动训练需要等待10多分钟,30B的模型A100 80G 启动训练要卡30分钟以上, Megatron就启动很快。。。 Hi @joan126 Can you open a new issue and provide details? So we can reproduce your question. Thanks. https://github.com/hpcaitech/ColossalAI/issues/new/choose

> I got this error too when useing pytorch 1.13.0, colossalai 0.2.8 with transformers 4.28.1 and 4.24.0 How about PyTorch 1.12.1?

Glad to hear it was resolved. Thanks.