ColossalAI
ColossalAI copied to clipboard
[BUG]: train_gpt_demo.py cannot load parameters from model file
🐛 Describe the bug
I want to load model parameters from a pretrained model after use tp, but use train_gpt_demo.py after zero_model_wrapper also made tensor shape mismatch error
Environment
No response
Hi @zhanghaoie maybe you can refer to this example https://github.com/hpcaitech/ColossalAI/tree/main/applications/ChatGPT#how-to-saveload-checkpoint
We have updated a lot. This issue was closed due to inactivity. Thanks.