Megatron-LM icon indicating copy to clipboard operation
Megatron-LM copied to clipboard

[QUESTION] Does Megatron-Core supports LLAMA models?

Open noob-ctrl opened this issue 9 months ago • 5 comments

Does Megatron-Core supports LLAMA models?

noob-ctrl avatar May 03 '24 08:05 noob-ctrl

yes

ethanhe42 avatar May 03 '24 23:05 ethanhe42

@ethanhe42 When transformer-impl is local, it reports the following error: AssertionError: (RMSNorm) is not supported in FusedLayerNorm when instantiating FusedLayerNorm when instantiating TransformerLayer When transformer-impl is transformer_engine, the following code does not seem to define RMSNorm? image So do I need to make any changes when I want to use llama?

noob-ctrl avatar May 04 '24 01:05 noob-ctrl

You need to use mcore models. local is deprecating

ethanhe42 avatar May 04 '24 03:05 ethanhe42

@ethanhe42 When transformer-impl is set to transformer_engine, the following code does not seem to define RMSNorm? image

noob-ctrl avatar May 04 '24 16:05 noob-ctrl

It's handled by TEnorm

ethanhe42 avatar May 05 '24 04:05 ethanhe42

Marking as stale. No activity in 60 days.

github-actions[bot] avatar Jul 04 '24 18:07 github-actions[bot]