ufs-weather-model
ufs-weather-model copied to clipboard
bug fixes: kchunk3d ignored, hailwat uninitialized in dycore, tile_num wrong for nests
Commit Queue Requirements:
- [x] Fill out all sections of this template.
- [x] All sub component pull requests have been reviewed by their code managers.
- [x] Run the full Intel+GNU RT suite (compared to current baselines) on either Hera/Derecho/Hercules
- [x] Commit 'test_changes.list' from previous step
Description:
Fixes these bugs:
- Fix from @DusanJovic-NOAA wherein the
kchunk3d
setting in model_configure was ignored. This caused an abort due to a negative index in an MPI call on some platforms. This my have been due to a 32-bit integer wraparound, but we cannot confirm that. - A hailwat variable was uninitialized in the FV3 dynamical core. Now it is set to the hailwat tracer index.
- The tile_num sent to CCPP in FV3 was wrong for the nest because it was the index of the tile in the mosaic (index 1) instead of the "global tile number" (index 7). This is corrected by having the dynamical core pass the "global tile number" up to the model.
No answers should change.
Commit Message:
* UFSWM -
* FV3 - correct handing of kchunk3d and use the right tile number in CCPP
* atmos_cubed_sphere - initialize the hailwat variable and pass global_tile up to model
Priority:
- Critical
Git Tracking
UFSWM:
Issues:
- UFSWM: fixes https://github.com/ufs-community/ufs-weather-model/issues/2209
- UFSWM: fixes https://github.com/ufs-community/ufs-weather-model/issues/2227
- FV3: https://github.com/NOAA-EMC/fv3atm/issues/797
- GFDL_atmos_cubed_sphere:
- https://github.com/NOAA-GFDL/GFDL_atmos_cubed_sphere/issues/328
- https://github.com/NOAA-GFDL/GFDL_atmos_cubed_sphere/issues/329
- GFDL_atmos_cubed_sphere:
- FV3: https://github.com/NOAA-EMC/fv3atm/issues/797
Note: Although #2227 is an issue in this repository, the bug is in FV3.
Sub component Pull Requests:
- FV3: https://github.com/NOAA-EMC/fv3atm/pull/806
- atmos_cubed_sphere: https://github.com/NOAA-GFDL/GFDL_atmos_cubed_sphere/pull/331
UFSWM Blocking Dependencies:
Changes
Regression Test Changes (Please commit test_changes.list):
- No Baseline Changes.
Input data Changes:
- None.
Library Changes/Upgrades:
- No Updates
Testing Log:
- RDHPCS
- [ ] Hera
- [ ] Orion
- [ ] Hercules
- [ ] Jet
- [ ] Gaea
- [ ] Derecho
- WCOSS2
- [ ] Dogwood/Cactus
- [ ] Acorn
- [ ] CI
- [ ] opnReqTest (complete task if unnecessary)
@SamuelTrahanNOAA EPIC wants to go with this PR next. Could you run the full suite on Hera and commit the test_changes.list please?
I am rerunning regression tests now. 259 of 299 tests have completed and none have failed. I disabled job resubmission, so this means the tests are passing on the first try.
Could someone please request reviews from these individuals?
@DusanJovic-NOAA @zhanglikate @kayeekayee @spanNOAA @ChristianBoyer-NOAA
They have been involved in testing the fix for the critical kchunk3d bug
Could someone please request reviews from these individuals?
@DusanJovic-NOAA @zhanglikate @kayeekayee @spanNOAA @ChristianBoyer-NOAA
They have been involved in testing the fix for the critical kchunk3d bug
Only Dusan seems to be allowed as a requested reviewer, but the others can still give a review i believe
Regression tests passed. No baseline changes.
EDIT: Regression tests passed on Hera. I didn't run them anywhere else.
I've merged develop. Those changes were all CICE, so they should not affect this PR's changes nor the bug people are encountering. Hence, I am not rerunning regression tests unless someone asks me to do that. Code managers will run regression tests in the ordinary testing process.
I've merged develop. Those changes were all CICE, so they should not affect this PR's changes nor the bug people are encountering. Hence, I am not rerunning regression tests unless someone asks me to do that. Code managers will run regression tests in the ordinary testing process.
So everyone knows, the intention for the full RT suite being run is not to be re-done unless code changes related to the bug/feature being added/fixed are made. So merging with develop is not in that category and does not need to be rerun.
I've retested my nested global case inside the global-workflow and it has passed the failure point. (I had already tested it outside the workflow.) This triad of fixes still work for me. I look forward to seeing them in the develop branch.
We are going to start working on this pr today. @FernandoAndrade-NOAA @BrianCurtis-NOAA FYI
Jet hasn't finished. Did something go wrong over there?
I've had lots of little technical issues while running on Jet since the Rocky upgrade.
Jet hasn't finished. Did something go wrong over there?
I've had lots of little technical issues while running on Jet since the Rocky upgrade.
It was just a little slow yesterday, it looks like it passed, I'll push it up shortly.
We can proceed with the merging process. I'll follow up on the cubed-sphere
The cubed-sphere PR has been merged. I updated the FV3 PR to point to the authoritative .gitmodules and cubed sphere.
You can proceed to merging the FV3 PR.
@SamuelTrahanNOAA FV3 merged.. hash: https://github.com/NOAA-EMC/fv3atm/commit/37e7d4859db4eb75472091abc650831060037715
I have reverted .gitmodules and pointed FV3 to the head of the authoritative develop branch.
This PR is ready for final review and merge.