Megatron-LM
Megatron-LM copied to clipboard
Fix wrong key for output_layer_init_method
Given that we aim to use mcore to do the training, we have a function to parse the args from Megatron-LM to mcore.
Howover, the key of output_layer_init_method
is incorrect. [1]
This PR fix this.
[1] https://github.com/NVIDIA/Megatron-LM/blob/main/megatron/core/transformer/transformer_config.py#L95
Marking as stale. No activity in 60 days.