Megatron-LM icon indicating copy to clipboard operation
Megatron-LM copied to clipboard

[QUESTION] When will model have `_extra_state`?

Open 1049451037 opened this issue 7 months ago • 1 comments

After updating to the main branch of Megatron-LM recently, I met this error when loading model:

Unexpected key(s) in state_dict: "decoder.layers.0.self_attention.core_attention._extra_state"

The checkpoint is transformed by the tools/checkpoint/convert.py, and loaded by pretrain_gpt.py.

1049451037 avatar Jul 03 '24 06:07 1049451037