badarrrr
badarrrr
Traceback (most recent call last): File "/gemini/code/train/main.py", line 130, in train() File "/gemini/code/train/main.py", line 126, in train trainer.train(resume_from_checkpoint=False) File "/root/miniconda3/lib/python3.11/site-packages/transformers/trainer.py", line 1948, in train return inner_training_loop( ^^^^^^^^^^^^^^^^^^^^ File "/root/miniconda3/lib/python3.11/site-packages/transformers/trainer.py", line...
Thanks for your comment.I'll try it later
> > 在deepspeed config里将stage3_prefetch_bucket_size设为15099494试试呢? > > 可以,但是会报新错误: RuntimeError: shape '[32768, -1, 1, 32, 2]' is invalid for input of size 524288 我遇到了跟你一模一样的错误: Traceback of TorchScript (most recent call last): File...
> > # 附上运行 ' ./scripts/glm4_longwriter.sh' 时的报错信息: > > KeyError: '' Using unk_token, but it is not set yet. Traceback (most recent call last): File "/root/AI4E/ljc/LongWriter/train/main.py", line 139, in train()...