Pangu Improvements
Modulus Pull Request
Description
This PR adds the following features/changes to the Pangu model and training script:
- Configurable number of constant, surface, and atmosphere variables in the model.
- Configurable number of upsampled and downsampled transformer blocks.
- Gradient checkpointing support in the Pangu processor (encoder/decoder) layers.
- Improved training script with improved static capture support, multistep rollout, validation function, and weighted loss function.
Checklist
- [ ] I am familiar with the Contributing Guidelines.
- [x] New or existing tests cover these changes.
- [ ] The documentation is up to date with these changes.
- [ ] The CHANGELOG.md is up to date with these changes.
- [ ] An issue is linked to this pull request.
/blossom-ci
/blossom-ci
/blossom-ci
/blossom-ci
/blossom-ci
Depends on #660
I added some minor suggestions, overall this looks great though. I found the LambdaLR scheme with the custom hydra resolver a bit convoluted, maybe a ConstantLR would be simpler and more readable to achieve the same effect. However it is nice to have the example in there if someone wants to do more custom scheduling.