Params Raman

Results 2 comments of Params Raman

@xyltt @DY-TL I encountered this error too. Below is the trace. Did any of you find out the cause and fix for this? Appreciate any pointers. ` File "/workspace/prraman/megatron/optimizer.py", line...

Following up with some debugging on this, I found that the flag `params_have_main_grad` which causes the above code to look for `main_grad` attribute is set here (https://github.com/NVIDIA/Megatron-LM/blob/main/megatron/optimizer/__init__.py#L71,L73) in the __init__.py...