VisualGLM-6B icon indicating copy to clipboard operation
VisualGLM-6B copied to clipboard

报错RuntimeError: Error building extension 'fused_adam'

Open zbxx-423 opened this issue 1 year ago • 10 comments

Snipaste_2023-10-19_15-41-33 这是cuda版本算力问题吗

zbxx-423 avatar Oct 19 '23 07:10 zbxx-423

这不是算力的问题,是nvcc压根没安装……或者没加到环境变量里

1049451037 avatar Oct 19 '23 07:10 1049451037

这不是算力的问题,是nvcc压根没安装……或者没加到环境变量里

nvcc放到他指定的目录里解决掉之后 还是报RuntimeError: Error building extension 'fused_adam' 这个错误 我用的是两张3090 报错之前看显卡使用都是100%

zbxx-423 avatar Oct 23 '23 07:10 zbxx-423

参考这里:https://github.com/THUDM/VisualGLM-6B/issues/125#issuecomment-1630407747

1049451037 avatar Oct 23 '23 07:10 1049451037

我将cuda的版本改为11.7之后能解决这个问题,主要是deepspeed的问题

LijunRio avatar Nov 03 '23 13:11 LijunRio

我将cuda的版本改为11.7之后能解决这个问题,主要是deepspeed的问题

是deepspeed版本的问题吗,什么版本呢

chenchen333-dev avatar Mar 14 '24 10:03 chenchen333-dev

怎么解决呢?

a1stupid avatar Mar 28 '24 00:03 a1stupid

Same issue, how to fix it?

zzyzeyuan avatar Apr 03 '24 07:04 zzyzeyuan

@zzyzeyuan Have you fixed it?

Gae-Zhang avatar Apr 10 '24 09:04 Gae-Zhang

@zzyzeyuan Have you fixed it?

Yes, see this page.

The error may be caused by the wrong installation of deepspeed, and you just need to clone the repo of deepspeed and install it manually like:

git clone https://github.com/microsoft/DeepSpeed.git
cd DeepSpeed
DS_BUILD_FUSED_ADAM=1 pip3 install .

zzyzeyuan avatar Apr 10 '24 10:04 zzyzeyuan

@zzyzeyuan Have you fixed it?

Yes, see this page.

The error may be caused by the wrong installation of deepspeed, and you just need to clone the repo of deepspeed and install it manually like:

git clone https://github.com/microsoft/DeepSpeed.git
cd DeepSpeed
DS_BUILD_FUSED_ADAM=1 pip3 install .

Thank you so much!! 祝您科研顺利!

Gae-Zhang avatar Apr 10 '24 12:04 Gae-Zhang