ColossalAI
ColossalAI copied to clipboard
[BUG]: memory not decrease
🐛 Describe the bug
I use the train_pokemon.yaml file to train the model on my device with 3090ti gpu, then I got an OOM error. so I set the batch size to 1, but the gpu used is 18G, there are must some thing wrong....
Environment
No response
Hi @songjin321 Thank you for your feedback. We will try to reproduce your issue and fix it soon.
you can try train_colossalai.yaml
the train_pokemon.yaml is not completed