ColossalAI
ColossalAI copied to clipboard
有什么训练策略是可以直接在单机单显卡运行的吗?[DOC]:
📚 The doc issue
有相关的文档说明吗
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
Title: Is there any training strategy that can be run directly on a single computer with a single graphics card? [DOC]:
Try this one out: https://github.com/hpcaitech/ColossalAI/tree/main/examples/language/gpt/gemini
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "./train_gpt_demo.py", line 353, in
这个需要什么配置才能跑起来呢 @JThh
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
What configuration does this need to run? @JThh
Can I know what's your current configuration? 你目前的配置是怎样的呢?
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
Can I know what's your current configuration? What is your current configuration?
+---------------------------------------------------------------------------------------+ | NVIDIA-SMI 530.30.02 Driver Version: 531.18 CUDA Version: 12.1 | |-----------------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |=========================================+======================+======================| | 0 NVIDIA GeForce GTX 1070 On | 00000000:01:00.0 Off | N/A | | 45% 47C P8 8W / 150W| 0MiB / 8192MiB | 0% Default | | | | N/A | +-----------------------------------------+----------------------+----------------------+ | 1 NVIDIA GeForce RTX 3080 On | 00000000:03:00.0 On | N/A | | 0% 46C P8 21W / 340W| 1126MiB / 10240MiB | 0% Default | | | | N/A | +-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=======================================================================================| | No running processes found | +---------------------------------------------------------------------------------------+
@JThh 这是我目前的配置 2张显卡 ,3080 + 1070
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
@JThh this is my current configuration 2 graphics cards , 3080 + 1070
我把batchsize都调的很小,还是说内存不够 export PLACEMENT=${PLACEMENT:-"cput"} ----> 这个参数是不是应该设置为cuda
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
I adjusted the batchsize very small, or the memory is not enough export PLACEMENT=${PLACEMENT:-"cput"} ----> Should this parameter be set to cuda
我把batchsize都调的很小,还是说内存不够 export PLACEMENT=${PLACEMENT:-"cput"} ----> 这个参数是不是应该设置为cuda
你跑多大的gpt2,报错的
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
I adjusted the batchsize very small, or the memory is not enough export PLACEMENT=${PLACEMENT:-"cput"} ----> Should this parameter be set to cuda
How much gpt2 did you run, and it reported an error
Take a look at this.
We have updated a lot. Please check the latest code. This issue was closed due to inactivity. Thanks.