DeepSpeed
DeepSpeed copied to clipboard
Recover shared parameters
To address this issue:
Shared parameters that hold reference are missing when extracts fp32 weights from checkpoint. This PR recovers such shared parameters by linking them with their partners.