Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spack-stack moved from /lfs4 to /contrib on Jet #977

Closed
InnocentSouopgui-NOAA opened this issue Aug 28, 2024 · 8 comments · Fixed by #981
Closed

Spack-stack moved from /lfs4 to /contrib on Jet #977

InnocentSouopgui-NOAA opened this issue Aug 28, 2024 · 8 comments · Fixed by #981
Assignees
Labels
maintenance Basic upkeep

Comments

@InnocentSouopgui-NOAA
Copy link
Contributor

InnocentSouopgui-NOAA commented Aug 28, 2024

Spack stack moved from the storage /lfs4 to a new storage space /contrib, following the failure of /lfs4.

The requires the update of module files for all softwares supported on Jet

Refs NOAA-EMC/global-workflow#2377

@InnocentSouopgui-NOAA
Copy link
Contributor Author

@InnocentSouopgui-NOAA is working on this issue

@InnocentSouopgui-NOAA
Copy link
Contributor Author

ice_blend consistency test is failing on segfault while running copygb2

+ /contrib/spack-stack/spack-stack-1.6.0/envs/unified-env-rocky8/install/intel/2021.5.0/grib-util-1.3.0-74mdurc/bin/copygb2 -x -i3 -g '0 0 0 0 0 0 0 0 4320 2160 0 0 89958000 42000 48 -89958000 359958000 83000 83000 0' ims.icec.grib2 ims.icec.5min.grib2
forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image              PC                Routine            Line        Source             
copygb2            00000000004D67BA  Unknown               Unknown  Unknown
libpthread-2.28.s  00007FA269B92CF0  Unknown               Unknown  Unknown
copygb2            000000000040A85E  Unknown               Unknown  Unknown
copygb2            00000000004125EC  Unknown               Unknown  Unknown
copygb2            0000000000408377  Unknown               Unknown  Unknown
copygb2            0000000000407122  Unknown               Unknown  Unknown
libc-2.28.so       00007FA2697F5D85  __libc_start_main     Unknown  Unknown
copygb2            000000000040702E  Unknown               Unknown  Unknown

@GeorgeGayno-NOAA
Copy link
Collaborator

ice_blend consistency test is failing on segfault while running copygb2

+ /contrib/spack-stack/spack-stack-1.6.0/envs/unified-env-rocky8/install/intel/2021.5.0/grib-util-1.3.0-74mdurc/bin/copygb2 -x -i3 -g '0 0 0 0 0 0 0 0 4320 2160 0 0 89958000 42000 48 -89958000 359958000 83000 83000 0' ims.icec.grib2 ims.icec.5min.grib2
forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image              PC                Routine            Line        Source             
copygb2            00000000004D67BA  Unknown               Unknown  Unknown
libpthread-2.28.s  00007FA269B92CF0  Unknown               Unknown  Unknown
copygb2            000000000040A85E  Unknown               Unknown  Unknown
copygb2            00000000004125EC  Unknown               Unknown  Unknown
copygb2            0000000000408377  Unknown               Unknown  Unknown
copygb2            0000000000407122  Unknown               Unknown  Unknown
libc-2.28.so       00007FA2697F5D85  __libc_start_main     Unknown  Unknown
copygb2            000000000040702E  Unknown               Unknown  Unknown

Can you check in your changes to your branch so I can run exactly what you are?

How are you invoking the driver script?

@InnocentSouopgui-NOAA
Copy link
Contributor Author

InnocentSouopgui-NOAA commented Sep 6, 2024

I pushed all my updates to https://github.com/InnocentSouopgui-NOAA/UFS_UTILS/tree/migration-jet-contrib

To run the ice_blend test

  1. set the account for the job inside the script
  2. set Work directory export WORK_DIR=/lfs5/NESDIS/nesdis-rdo2/Innocent.Souopgui/stmp
  3. cal the script from inside the directory reg_tests/ice_blend as:
    ./driver.jet.sh

All other consistency tests were successful.

@InnocentSouopgui-NOAA
Copy link
Contributor Author

@GeorgeGayno-NOAA
Have you had the chance to look at this problem again?

In case it takes longer, I suggest we open a separate issue for this problem and move forward to deliver Global workflow on Jet?
People who run global Workflow on Jet are not able to run since the crash of lfs4.

What are your thoughts?

@GeorgeGayno-NOAA
Copy link
Collaborator

I pushed all my updates to https://github.com/InnocentSouopgui-NOAA/UFS_UTILS/tree/migration-jet-contrib

To run the ice_blend test

  1. set the account for the job inside the script
  2. set Work directory export WORK_DIR=/lfs5/NESDIS/nesdis-rdo2/Innocent.Souopgui/stmp
  3. cal the script from inside the directory reg_tests/ice_blend as:
    ./driver.jet.sh

All other consistency tests were successful.

Oh. Step 3 might be the problem. Try sbatch driver.jet.sh.

@InnocentSouopgui-NOAA
Copy link
Contributor Author

Thank you,
I did not pay full attention that this script calling sequence was different from others.

Everything works.
Let's move forward with the pull request #978

@GeorgeGayno-NOAA
Copy link
Collaborator

A few minor bugs need to be fixed. Reopening.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Sep 9, 2024
GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Sep 9, 2024
GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Sep 10, 2024
by loading the grib-util module in the ice_blend regression
test script.

Fixes ufs-community#977.
GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Sep 10, 2024
@GeorgeGayno-NOAA GeorgeGayno-NOAA mentioned this issue Sep 10, 2024
2 tasks
WalterKolczynski-NOAA pushed a commit to NOAA-EMC/global-workflow that referenced this issue Sep 30, 2024
Migrates Global Workflow to use contrib installation of spack-stack on
Jet.
Following the failure of the storage /lfs4 on Jet, the installation of
spack spack moved to /contrib.
All softwares relying on spack-stack on Jet needs update.

Resolves #2841 
Refs NOAA-EMC/gfs-utils#78
Refs NOAA-EMC/GSI#786
Refs NOAA-EMC/GSI-Monitor#143
Refs NOAA-EMC/GSI-utils#51
Refs ufs-community/UFS_UTILS#977
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
maintenance Basic upkeep
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

2 participants