lcaruewu
lcaruewu
> Can you tell me the machine configuration which you successfully ran the train.py ? I meet the same problem, but i have no idea
> 
> > > > > > > > > Can you tell me the machine configuration which you successfully ran the train.py ? I meet the same problem, but i...
but i also to monti > > > > > > > > > > > > > Can you tell me the machine configuration which you successfully ran the...
> Have you ever tried this: `git clone https://github.com/huggingface/transformers.git` `cd transformers` `git checkout 0041be5` `pip install .` > > maybe it works. thank for you reply, i have follow you...
me too, my config: V100 16G * 4, CPU RAM 128G , how to solve this problem?
i meet same problem
i solve this problem by add config, now 4 * V100 32G RAM 328G,I can run 13B
用了deepspeed不? > Minxiangliu
> > 用了deepspeed不? > > > Minxiangliu > > I am not using DeepSpeed. Here are the commands I am running. > > ``` > TORCH_CPP_LOG_LEVEL=INFO NCCL_DEBUG=INFO LOGLEVEL=INFO \ >...