Oleg Filatov

Results 1 issues of Oleg Filatov

Hi! I've been looking into the integration of `muP` into the `Megatron-LM` setup and I was wondering about the `_rescale_parameters()` method of `MuReadout` in case of shared (tied) input/output embeddings....