ChatGLM-6B icon indicating copy to clipboard operation
ChatGLM-6B copied to clipboard

Deepspeed finetune需要多大的显存?

Open twosnowman opened this issue 2 years ago • 3 comments

Is there an existing issue for this?

  • [X] I have searched the existing issues

Current Behavior

3060 12G显卡deepspeed训练时报显存不够

Expected Behavior

No response

Steps To Reproduce

deepspeed训练需要多大显存?

Environment

OS: Ubuntu 22.04
Python: 3.10
Transformers: 4.26.1
PyTorch: 1.12
CUDA Support: True

Anything else?

No response

twosnowman avatar May 06 '23 03:05 twosnowman

24GB batch size为1 都不够

kjgfjlkj avatar May 06 '23 04:05 kjgfjlkj

测试 deepspeed zero2 62G /单张

gawei1995 avatar May 06 '23 07:05 gawei1995

看你训练样本,调整一个参数,再搞个量化也许能成,不过12G确实有点少

TE-Raven avatar May 12 '23 09:05 TE-Raven

请问 显卡吞吐量如何 大概多少tokens/GPU/s

newtonysls avatar May 19 '23 09:05 newtonysls

看你显卡型号

gawei1995 avatar May 19 '23 10:05 gawei1995

Duplicate of #556

zhangch9 avatar Aug 16 '23 12:08 zhangch9