ColossalAI
ColossalAI copied to clipboard
torch.cuda.OutOfMemoryError: CUDA out of memory
🐛 Describe the bug
A10080G8卡的机器,batch_size=1,7B的llama-2模型,train_sft.py和train_reward_model.py都跑不起来
Environment
You are using a model of type mistral to instantiate a model of type llama. This is not supported for all configurations of models and can yield errors