Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update ufs-weather for latest CICE emc/develop; Update to esmf-bs47; add export of openwater_frac_in_atm from fv3ATM when coupled #475

Merged
merged 83 commits into from
Apr 5, 2021

Conversation

DeniseWorthen
Copy link
Collaborator

@DeniseWorthen DeniseWorthen commented Mar 14, 2021

Description

This PR will:

Issue(s) addressed

Testing

  • This PR will change all ufs-coupled and ufs-datm baselines due to a non-b4b bugfix in Icepack. A test baseline was created on hera.intel and all cpld and datm regression tests passed against that test baseline.

  • This PR also changes baselines due to the change to esmf-bs47 because some forecast files will contain different metadata or different ordering of fields. See fv3ATM Issue #265 for more details on the changes expected.

  • hera.intel
  • hera.gnu
  • orion.intel
  • cheyenne.intel
  • cheyenne.gnu
  • gaea.intel
  • jet.intel
  • wcoss_cray
  • wcoss_dell_p3

Dependencies

NEMS PR#94
CICE PR #24
CMEPS PR #39
FV3-ATM PR #266

DeniseWorthen and others added 30 commits October 28, 2020 12:47
use updcmeps branch of CMEPS-interface/CMEPS
add file to CMakeLists
shorten name of nems.configure file for coupled model
clean up white space
add rahul's fix for optionally loading fv3_debug if exists and
debug=y
add file to cice_files.cmake
update CICE with use statement added to CICEInitMod
add regression test log showing that debug test passes baseline
cpld_control fails due to refactor
* Add $INPUTDATA_ROOT
* Update utest
* rt.sh: Remove LOCKDIR before trying to stop ecflow in cleanup()
@DeniseWorthen
Copy link
Collaborator Author

@junwang-noaa all the submodules have approvals at this point except CICE--can you please review/approve?

@DeniseWorthen
Copy link
Collaborator Author

I put the jet baseline in the wrong location for the auto-rt to find it (/lfs4/HFIP/hfv3gfs/RT/NEMSfv3gfs vs /lfs4/HFIP/hfv3gfs/emc.nemspara/RT/NEMSfv3gfs) so we'll be restarting the jet auto-rt.

@BrianCurtis-NOAA
Copy link
Collaborator

Machine: jet
Compiler: intel
Repo location: /lfs4/HFIP/h-nems/emc.nemspara/autort/pr/592556664/20210402160009/ufs-weather-model
Please manually delete: /lfs4/HFIP/hfv3gfs/emc.nemspara/RT_RUNDIRS/emc.nemspara/FV3_RT/rt_18641
Test fv3_stretched_nest 019 failed in run_test failed
Test fv3_HAFS_v0_hwrf_thompson 046 failed in check_result failed
Test fv3_HAFS_v0_hwrf_thompson 046 failed in run_test failed
Please make changes and add the following label back:
jet-intel-RT

@DeniseWorthen
Copy link
Collaborator Author

DeniseWorthen commented Apr 2, 2021

The fv3_stretched_nest failed due to time wall clock but it actually seemed to have hung. It is a 48hr forecast and it looks like it only got a few timesteps.

The HAFS test failed on the 4x daily tile2 because the baseline file has no time level information written:

time = UNLIMITED ; // (0 currently)

The other 4x daily files for the other tiles have 4 timesteps correctly written and they compare identically with the verify run.

I will add that I'm having some trouble getting response from jet when I ls or cd. So perhaps there is a system issue going on here too.

@DeniseWorthen
Copy link
Collaborator Author

The baselines on jet are not completing (compiling) because the disk is full:

CMakeFiles/ccppphys.dir/physics/GFS_phys_time_vary.fv3.F90.o: catastrophic error: Disk is full, no space to write object file

This is in my own attempt to run the last two baselines as well as the auto-rt I started late yesterday.

@BrianCurtis-NOAA
Copy link
Collaborator

Machine: jet
Compiler: intel
Repo location: /lfs4/HFIP/h-nems/emc.nemspara/autort/pr/592556664/20210402230015/ufs-weather-model
Please manually delete: /lfs4/HFIP/hfv3gfs/emc.nemspara/RT_RUNDIRS/emc.nemspara/FV3_RT/rt_150647
Test compile 014 failed failed
Test compile 013 failed failed
Test fv3_cpt 032 failed in run_test failed
Test fv3_wrtGauss_nemsio_c192 010 failed in run_test failed
Test fv3_gfs_v16_RRTMGP_2thrd 041 failed in run_test failed
Test fv3_esg_HAFS_v0_hwrf_thompson 047 failed in run_test failed
Please make changes and add the following label back:
jet-intel-RT

* update CICE, CMEPS and NEMS submodules
@DeniseWorthen
Copy link
Collaborator Author

Ready for review & merge.

@@ -40,7 +40,7 @@ RUN | fv3_stretched_nest
COMPILE | APP=ATM SUITES=FV3_GFS_v15_thompson_mynn 32BIT=Y | | fv3 |
RUN | fv3_regional_control | | fv3 |
RUN | fv3_regional_restart | | fv3 | fv3_regional_control
RUN | fv3_regional_quilt | | fv3 |
#RUN | fv3_regional_quilt | | fv3 |
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we have a timeline for fixing this?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think Jun has tracked down the issues here. She's opened an associated issue in EMC_post.

@@ -56,6 +56,7 @@ export LIST_FILES="atmos_4xdaily.tile1.nc \
export_fv3

export TASKS=192
export WLCLK=60
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These longer runtimes need to be investigated.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A full 60 minutes is not required but this is what I tested out of frustration. The runs ended up taking about 20-25 minutes in my case.

@MinsukJi-NOAA
Copy link
Contributor

@DeniseWorthen could you include these changes? It fixes the esmf bs47 docker container issue.

diff --git a/tests/ci/Dockerfile b/tests/ci/Dockerfile
index c685eaf..8d01299 100644
--- a/tests/ci/Dockerfile
+++ b/tests/ci/Dockerfile
@@ -1,4 +1,4 @@
-From noaaemc/ubuntu-hpc:v1.1
+From noaaemc/ubuntu-hpc:v1.2
 
 ENV HOME=/home/builder
 COPY --chown=builder:builder . $HOME/ufs-weather-model

@DeniseWorthen
Copy link
Collaborator Author

@MinsukJi-NOAA Should I now try run-ci?

@MinsukJi-NOAA
Copy link
Contributor

Yes, that would be great.

@DeniseWorthen
Copy link
Collaborator Author

@MinsukJi-NOAA Two CI tests failed. Should I hold off merging until you can take a look?

@MinsukJi-NOAA
Copy link
Contributor

I will take a look. I suggest you go ahead with merging for this PR.

@DeniseWorthen DeniseWorthen merged commit b8ed338 into ufs-community:develop Apr 5, 2021
pjpegion pushed a commit to NOAA-PSL/ufs-weather-model that referenced this pull request Apr 4, 2023
…(combined) (ufs-community#475)

* add lsm_cold_start variable in GFS_typedefs.F90/meta and update ccpp/physics for testing
* includes @SMoorthi-emc 's bugfix for gwdps.f.
epic-cicd-jenkins pushed a commit that referenced this pull request Apr 17, 2023
…e-offset (#475)

* change PDY to yyyymmdd for time-offset

* change prod in nco we2e script

* modify ex-script rather than machine file

* fix typo

* Set PDYext cycext

Co-authored-by: chan-hoo <chan-hoo.jeon@clogin01.cactus.wcoss2.ncep.noaa.gov>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Baseline Updates Current baselines will be updated. Waiting for Reviews The PR is waiting for reviews from associated component PR's.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants