Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rename CCPP branches from master to main, several small changes in ccpp-physics #572

Merged

Conversation

climbfuji
Copy link
Collaborator

@climbfuji climbfuji commented May 12, 2021

PR Checklist

  • Ths PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR. Please consult the ufs-weather-model wiki if you are unsure how to do this.

  • This PR has been tested using a branch which is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR

  • An Issue describing the work contained in this PR has been created either in the subcomponent(s) or in the ufs-weather-model. The Issue should be created in the repository that is most relevant to the changes in contained in the PR. The Issue and the dependent sub-component PR
    are specified below.

  • If new or updated input data is required by this PR, it is clearly stated in the text of the PR.

Description

This PR and dependencies below change the name of the ccpp-framework and ccpp-physics branches from "main" to "master" and, in addition, include a few small updates and bugfixes in ccpp-framework and ccpp-physics.

Also included: bug fixes for wcoss_dell_p3 modulefiles to correctly load cmake (from @DusanJovic-NOAA)

Update 2021/05/14

This PR now contains the following two PRs:

Update 2021/05/17

The changes in file GFS_suite_interstitial.F90 (use assumed-size array declarations) in ccpp-physics PR https://github.com/NCAR/ccpp-physics/pull/652/files change the regression test baseline for the following three tests with Intel in PROD mode:

cpld_bmarkfrac_v16
cpld_bmarkfrac_wave
cpld_bmarkfrac_wave_v16

Note that only one file is different, the model run itself completes w/o changes to the final checksums (this is the same for all three tests):

Dom.Heinzeller@hfe06:/scratch1/BMC/gmtb/Dom.Heinzeller/ufs-weather-model/ufs-weather-model-emc-develop-20210512-various-ccpp-changes/intel/tests [intel|gsd-hpcs]> cat log_hera.intel/rt_025_cpld_bmarkfrac_wave.log
baseline dir = /scratch1/NCEPDEV/nems/emc.nemspara/RT/NEMSfv3gfs/develop-20210513/INTEL/cpld_bmarkfrac_wave
working dir  = /scratch1/NCEPDEV/stmp2/Dom.Heinzeller/FV3_RT/rt_265707/cpld_bmarkfrac_wave
Checking test 025 cpld_bmarkfrac_wave results ....
 Comparing phyf024.tile1.nc .........OK
 Comparing phyf024.tile2.nc .........OK
 Comparing phyf024.tile3.nc .........OK
 Comparing phyf024.tile4.nc .........OK
 Comparing phyf024.tile5.nc .........OK
 Comparing phyf024.tile6.nc .........OK
 Comparing dynf024.tile1.nc .........OK
 Comparing dynf024.tile2.nc .........OK
 Comparing dynf024.tile3.nc .........OK
 Comparing dynf024.tile4.nc .........OK
 Comparing dynf024.tile5.nc .........OK
 Comparing dynf024.tile6.nc .........OK
 Comparing 20130402.000000.out_grd.gwes_30m .........OK
 Comparing 20130402.000000.out_pnt.points .........OK
 Comparing 20130402.000000.restart.gwes_30m .........OK
 Comparing RESTART/coupler.res .........OK
 Comparing RESTART/fv_core.res.nc .........OK
 Comparing RESTART/fv_core.res.tile1.nc .........OK
 Comparing RESTART/fv_core.res.tile2.nc .........OK
 Comparing RESTART/fv_core.res.tile3.nc .........OK
 Comparing RESTART/fv_core.res.tile4.nc .........OK
 Comparing RESTART/fv_core.res.tile5.nc .........OK
 Comparing RESTART/fv_core.res.tile6.nc .........OK
 Comparing RESTART/fv_srf_wnd.res.tile1.nc .........OK
 Comparing RESTART/fv_srf_wnd.res.tile2.nc .........OK
 Comparing RESTART/fv_srf_wnd.res.tile3.nc .........OK
 Comparing RESTART/fv_srf_wnd.res.tile4.nc .........OK
 Comparing RESTART/fv_srf_wnd.res.tile5.nc .........OK
 Comparing RESTART/fv_srf_wnd.res.tile6.nc .........OK
 Comparing RESTART/fv_tracer.res.tile1.nc .........OK
 Comparing RESTART/fv_tracer.res.tile2.nc .........OK
 Comparing RESTART/fv_tracer.res.tile3.nc .........OK
 Comparing RESTART/fv_tracer.res.tile4.nc .........OK
 Comparing RESTART/fv_tracer.res.tile5.nc .........OK
 Comparing RESTART/fv_tracer.res.tile6.nc .........OK
 Comparing RESTART/phy_data.tile1.nc .........OK
 Comparing RESTART/phy_data.tile2.nc .........OK
 Comparing RESTART/phy_data.tile3.nc .........OK
 Comparing RESTART/phy_data.tile4.nc .........OK
 Comparing RESTART/phy_data.tile5.nc .........OK
 Comparing RESTART/phy_data.tile6.nc .........OK
 Comparing RESTART/sfc_data.tile1.nc .........OK
 Comparing RESTART/sfc_data.tile2.nc .........OK
 Comparing RESTART/sfc_data.tile3.nc .........OK
 Comparing RESTART/sfc_data.tile4.nc .........OK
 Comparing RESTART/sfc_data.tile5.nc .........OK
 Comparing RESTART/sfc_data.tile6.nc .........OK
 Comparing RESTART/MOM.res.nc .........OK
 Comparing RESTART/MOM.res_1.nc .........OK
 Comparing RESTART/MOM.res_2.nc .........OK
 Comparing RESTART/MOM.res_3.nc .........OK
 Comparing RESTART/iced.2013-04-02-00000.nc .........OK
 Comparing RESTART/ufs.cpld.cpl.r.2013-04-02-00000.nc ............ALT CHECK......NOT OK
  0: The total amount of wall time                        = 917.045422
Test 025 cpld_bmarkfrac_wave FAIL

This difference is acceptable and due to the Intel compiler optimization. The new regression test baseline date is 20210517.

Issue(s) addressed

See associated PRs listed below.

Testing

Regression tests will be run on all tier-1 platforms prior to committing the chaanges.

  • hera.intel
  • hera.gnu
  • orion.intel
  • cheyenne.intel
  • cheyenne.gnu
  • gaea.intel
  • jet.intel
  • wcoss_cray
  • wcoss_dell_p3

Dependencies

NCAR/ccpp-framework#372
NCAR/ccpp-physics#658
NOAA-EMC/fv3atm#300
#572

MinsukJi-NOAA and others added 6 commits May 12, 2021 14:34
* CI runs when run-ci label is created
* Old method of specifying run-ci in commit message is obsolete
* Push event no longer triggers the CI
* Minor fixes in aux.yml
@climbfuji climbfuji marked this pull request as ready for review May 13, 2021 19:06
@MinsukJi-NOAA
Copy link
Contributor

@climbfuji, #570 has been updated to the latest develop. However you want to incorporate it into your PR... Thanks!

@climbfuji climbfuji mentioned this pull request May 14, 2021
13 tasks
@climbfuji climbfuji added New Input Data Req'd This PR requires new data to be sync across platforms Waiting for Reviews The PR is waiting for reviews from associated component PR's. labels May 14, 2021
@climbfuji climbfuji mentioned this pull request May 14, 2021
13 tasks
@MinsukJi-NOAA
Copy link
Contributor

CI test needs to be skipped because of the CI-related changes in this PR; simply do not add the 'run-ci' label.

@climbfuji
Copy link
Collaborator Author

Machine: gaea
Compiler: intel
Job: RT
Repo location: /lustre/f2/pdata/ncep/emc.nemspara/autort/pr/643540985/20210517224508/ufs-weather-model
Please manually delete: /lustre/f2/scratch/emc.nemspara/FV3_RT/rt_36695
Test fv3_wrtGauss_netcdf_parallel 037 failed in check_result failed
Test fv3_wrtGauss_netcdf_parallel 037 failed in run_test failed
Please make changes and add the following label back:
gaea-intel-RT

This was my stupid mistake, running verify step again.

@DeniseWorthen
Copy link
Collaborator

On Dell, I had two errors creating baselines:

  1. disk quota error (fv3_rrfs_v1alpha)
  2. datm_cdeps_multiple_files_cfsr failed with error [0] Abort with message No such file or directory in file /usrx/local/nceplibs/dev/hpc-stack/src/hpc-stack/pkg/pio 2.5.2/src/clib/pioc_support.c at line 2747

I was able to generate the fv3_rrfs_v1apha baseline after deleting the oldest baseline (develop-20210301). The datm failed the second time with the same error.

Run directory: /gpfs/dell2/ptmp/Denise.Worthen/FV3_RT/rt_25766/datm_cdeps_multiple_files_cfsr
Build directory: /gpfs/dell2/emc/modeling/noscrub/Denise.Worthen/ufs-weather-model

Other than this datm test, the baseline is ready for verify on dell-p3.

@binli2337
Copy link
Contributor

@DeniseWorthen A few new input files needed by this new cdeps test were not on this wcoss-p3. I have copied the new input files to the input-data-20210518 directory. Please try this test again. Thanks!

@climbfuji
Copy link
Collaborator Author

Finally, all tests passed. As per @MinsukJi-NOAA 's instruction, we need to skip the CI tests for this commit. I will merge ccpp-framework and ccpp-physics now.

Copy link
Collaborator

@junwang-noaa junwang-noaa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code looks good to me. Need to change the submodules back to official repo.

@climbfuji climbfuji added the Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked. label May 18, 2021
@DeniseWorthen
Copy link
Collaborator

We need to revert the cdeps gitmodule.

@climbfuji
Copy link
Collaborator Author

We need to revert the cdeps gitmodule.

I am confused, this change is from @binli2337 and has been used for all tests?

@DeniseWorthen
Copy link
Collaborator

Yes, you merged it to this PR branch at 2bf1c9e, right?

@climbfuji
Copy link
Collaborator Author

Yes, you merged it to this PR branch at 2bf1c9e, right?

I pulled in #574 without any further changes to CDEPS.

@DeniseWorthen
Copy link
Collaborator

The gitmodules in this PR still points to a @binli2337 branch of CDEPS. That was merged to develop I believe.

@climbfuji
Copy link
Collaborator Author

The gitmodules in this PR still points to a @binli2337 branch of CDEPS. That was merged to develop I believe.

Got it. Let me try to update the submodule pointer, hopefully the code is still the same.

@climbfuji
Copy link
Collaborator Author

The gitmodules in this PR still points to a @binli2337 branch of CDEPS. That was merged to develop I believe.

Got it. Let me try to update the submodule pointer, hopefully the code is still the same.

This should be good now, please check. Code is identical before/after update of submodule pointer.

Copy link
Collaborator

@DeniseWorthen DeniseWorthen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me.

Copy link
Contributor

@binli2337 binli2337 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated files look good.

@climbfuji climbfuji merged commit 19e49ec into ufs-community:develop May 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Baseline Updates Current baselines will be updated. New Input Data Req'd This PR requires new data to be sync across platforms Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked. Waiting for Reviews The PR is waiting for reviews from associated component PR's.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants