ColossalAI
ColossalAI copied to clipboard
why GPTActor is not to load pretained parameters in chatgpt examples
π Describe the bug
with strategy.model_init_context(): if args.model == 'gpt2': actor = GPTActor().cuda() critic = GPTCritic().cuda()
Environment
No response
Bot detected the issue body's language is not English, translate it automatically. π―ππ»π§βπ€βπ§π«π§πΏβπ€βπ§π»π©πΎβπ€βπ¨πΏπ¬πΏ
Title: why GPTActor is not to load pretrained parameters in chatgpt examples
As it just shows training process, we just use randomly initialized model for simplicity. You can use a pretrained model easily by GPTActor(pretrained='gpt2')
.
https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat We have updated a lot. This issue was closed due to inactivity. Thanks.