Oleg Filatov
Results
1
issues of
Oleg Filatov
Hi! I've been looking into the integration of `muP` into the `Megatron-LM` setup and I was wondering about the `_rescale_parameters()` method of `MuReadout` in case of shared (tied) input/output embeddings....