Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[develop:] update hpc-stack location for Gaea, miniconda3, Lmod initialization, Lua modulefiles #1617

Closed

Conversation

natalie-perlin
Copy link
Collaborator

Description

Location of the HPC-stack updated for Gaea, Lmod 8.7.12 initialization and miniconda3 loading for Gaea, transition to use of lua modulefiles. A correction to the use of Lmod for loading modulefiles for RT setup. All regression tests pass.

This PR addresses the following issues:

  1. Software stack and miniconda3 location are updated for Gaea, which started in Issue-1465, resolved for all other Tier-1 platforms in RT-1596. An updated location for Gaea is /lustre/f2/dev/role.epic/contrib/hpc-stack/intel-2021.3.0_noarch/modulefiles/stack

  2. Lmod 8.7.12 initialization and module loading is properly set for Gaea. Modulefiles are transitioned to lua format.

  3. A working directory for the RT now creates a separate directory for modulefiles, and is not mixed with all the input/output and setup data. It otherwise creates a problem for Lmod when a large number of non-modulefiles are found in a directory that is stored as $MODULEPATH environmental (added to MODULEPATH by "module use ").

All the Regression Tests pass on Gaea successfully, as could be viewed in
/lustre/f2/scratch/ncep/Natalie.Perlin/UFS-WM/ufs-wm-hpcepic/tests
/lustre/f2/scratch/role.epic/FV3_RT/rt_14526/

-->

Top of commit queue on: TBD

Input data additions/changes

  • No changes are expected to input data.

Anticipated changes to regression tests:

  • No changes are expected to any regression test.
  • Changes are expected to the following tests:

Subcomponents involved:

  • AQM
  • CDEPS
  • CICE
  • CMEPS
  • CMakeModules
  • FV3
  • GOCART
  • HYCOM
  • MOM6
  • NOAHMP
  • WW3
  • stochastic_physics
  • none

Combined with PR's (If Applicable):

Commit Queue Checklist:

  • Link PR's from all sub-components involved
  • Confirm reviews completed in sub-component PR's
  • Add all appropriate labels to this PR.
  • Run full RT suite on either Hera/Cheyenne with both Intel/GNU compilers
  • Add list of any failed regression tests to "Anticipated changes to regression tests" section.

Linked PR's and Issues:

Testing Day Checklist:

  • This PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR.
  • Move new/updated input data on RDHPCS Hera and propagate input data changes to all supported systems.

Testing Log (for CM's):

  • RDHPCS
    • Intel
      • Hera
      • Orion
      • Jet
      • Gaea: /lustre/f2/scratch/ncep/Natalie.Perlin/UFS-WM/ufs-wm-hpcepic/tests/RegressionTests_gaea.intel.log
      • Cheyenne
    • GNU
      • Hera
      • Cheyenne
  • WCOSS2
    • Dogwood/Cactus
    • Acorn
  • CI
    • Completed
  • opnReqTest

@natalie-perlin natalie-perlin changed the title [develop:] update hpc-stack location for Gaea, miniconda3, Lmod initialization [develop:] update hpc-stack location for Gaea, miniconda3, Lmod initialization, Lua modulefiles Feb 17, 2023
@jkbk2004 jkbk2004 mentioned this pull request Mar 6, 2023
16 tasks
@jkbk2004
Copy link
Collaborator

jkbk2004 commented Mar 6, 2023

@natalie-perlin if we use lua on gaea, do we need to keep ufs_gaea.intel and ufs_gaea.intel_debug?

@jkbk2004
Copy link
Collaborator

jkbk2004 commented Mar 7, 2023

@natalie-perlin Is it ok to remove ufs_gaea.intel and ufs_gaea.intel_debug ?

@natalie-perlin
Copy link
Collaborator Author

@jkbk2004 - yes, they have to be removed!

@natalie-perlin
Copy link
Collaborator Author

@jkbk2004 -
ufs_gaea.intel and ufs_gaea.intel_debug need to be removed to avoid ambiguity of similarly named TCL and Lmod modulefiles in the same directory.

@zach1221
Copy link
Collaborator

zach1221 commented Mar 7, 2023

@natalie-perlin would you be able to answer Jong's question above and create PR to merge into Cameron's #1640 ?

@jkbk2004
Copy link
Collaborator

jkbk2004 commented Mar 7, 2023

@natalie-perlin then clean up those unnecessary files, please!

@natalie-perlin
Copy link
Collaborator Author

@natalie-perlin then clean up those unnecessary files, please!

The files ufs_gaea.intel and ufs_gaea.intel_debug have been already removed in this PR!

@jkbk2004
Copy link
Collaborator

jkbk2004 commented Mar 7, 2023

@natalie-perlin then clean up those unnecessary files, please!

The files ufs_gaea.intel and ufs_gaea.intel_debug have been already removed in this PR!

Oh! sorry about that. I was confused.

Copy link
Collaborator

@SamuelTrahanNOAA SamuelTrahanNOAA left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also needs to be tested with Rocoto.

I'm working on these things now in the combined PR #1645

@@ -15,7 +15,7 @@ echo -n " $( date +%s )," > job_timestamp.txt
cd $PBS_O_WORKDIR

set +x
module use $( pwd -P )
module use modulefiles
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These should be absolute paths so that they'll still work if the directory changes.

Suggested change
module use modulefiles
module use $PWD/modulefiles

@@ -16,7 +16,7 @@ echo -n " $( date +%s )," > job_timestamp.txt
set +x
MACHINE_ID=hera
source ./module-setup.sh
module use $( pwd -P )
module use modulefiles
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should use an absolute path so it'll still work if the directory changes.

Suggested change
module use modulefiles
module use $PWD/modulefiles

@@ -17,7 +17,7 @@ echo -n " $( date +%s )," > job_timestamp.txt
set +x
MACHINE_ID=jet
source ./module-setup.sh
module use $( pwd -P )
module use modulefiles
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should use an absolute path so it'll still work if the directory changes.

Suggested change
module use modulefiles
module use $PWD/modulefiles

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants