-
Notifications
You must be signed in to change notification settings - Fork 121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RELEASE] Fixes for gaea, noaacloud, and miniconda updates #444
[RELEASE] Fixes for gaea, noaacloud, and miniconda updates #444
Conversation
remove loading of system python3
remove loading system python module
Load updated miniconda3 and ask to activate regional_workflow enviroment
Load an updated miniconda3 and ask to activate regional_workflow environment
Update miniconda3 module location and ask to activate regional_workflow
Update miniconda3/4.12.0 module location and ask to activate regional_workflow environment
Run an additional cycle of "conda deactivate" and "conda activate regional_workflow". It ensures that _python3_ binary path from the *regional_workflow* environment becomes prepended to the search $PATH, and is found first, before the _python3_ from miniconda3/4.12.0 from the *base* environment.
"conda activate regional_workfow"
…orkflow.lua use new miniconda3/4.12.0 with regional_workflow environment
all the requested packages for the python3 are found in regional_workflow environment
load updated miniconda3/4.12.0 with regional_workflow environment
Load an updated miniconda3/4.12.0 with the regional_workflow environment
need to have miniconda3 loaded in build module
need to have miniconda3 loaded in the build module
need to have miniconda3 loaded in build module
need to have miniconda3 loaded in build module
need to have miniconda3 loaded in the build modulefile
need to have miniconda3 loaded in build modulefile
@danielabdi-noaa , @mark-a-potts - Also, the script load_modules_run_task.sh loads the |
@mark-a-potts Both manual and Jenkins CI tests are failing on Orion and Cheyenne. The Jenkins CI pipeline failed to connect, causing the failure on Jenkins. For the manual run on Orion, I see the following:
The message in log/make_orog.log:
For Cheyenne:
Please see https://jenkins-epic.woc.noaa.gov/blue/organizations/jenkins/ufs-srweather-app%2Fpipeline/detail/PR-444/2/pipeline for the Jenkins CI pipeline associated with this work. All tests on Hera, Jet, and Gaea have successfully passed, but there are issues on both Cheyenne and Orion. |
@mark-a-potts Orion build modulefile also misses python3 loading. |
A new issue submitted: #446 Discussing the option to retire the use of *local.lua files. If miniconda3 is loaded in build_ |
@MichaelLueken - the make_orog.local.lua file is missing on Orion, Hera, Jet... The SRW_ENV variable is not set, and therefore regional_environment is not activated. It used to be needed when different conda environments were needed by different tasks (e.g., pygraf environment for plotting). Now all of tasks use |
@natalie-perlin Do you know why the WE2E tests are behaving correctly on Hera and Jet? Is it a case that the |
@mark-a-potts On Cheyenne, when I updated to aa0bb86, the WE2E tests are running properly. Further, I made two changes on Orion:
Following these two modifications, the WE2E are now running there as well. I'll let you know if I encounter any failures with these modifications in my manual runs. |
@MichaelLueken - |
@natalie-perlin The Jenkins CI tests for that pipeline on Jet has passed and Hera has a SUCCESS for each test with the final pass coming within the next ten minutes. The issue now is only with Orion, but a manual run with the changes I've outlined above is currently running without issue. |
I just pushed changes with the added files for Orion. My manual test there ran to completion. |
I don't think we need to add the python module for Orion. The default system python3 is 3.7.5, which should be new enough for the build step of the WM. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The manual WE2E tests are currently running on Orion with the modifications I have laid out in this review. I'll rerun the Orion and Cheyenne tests in Jenkins, then this work can be merged.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since @mark-a-potts has successfully ran the modified files on Orion, I will go ahead and approve these changes for now. The Jenkins CI pipeline has been resubmitted for Orion. Once the tests pass, these changes will be ready to be merged.
@@ -8,8 +8,8 @@ whatis([===[Loads libraries needed for running SRW on Orion ]===]) | |||
load("contrib") | |||
load("rocoto") | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I did not end up putting a python module into the Orion module files, so there is no python to unload.
Successfully ran a single test with the PR-444 on Cheyenne, Gaea, Hera, Jet. UPD.: Orion test completed successfully |
Yes, it works now! |
DESCRIPTION OF CHANGES:
This PR combines fixes for the noaacloud, @natalie-perlin's miniconda updates, and fixes for the conda activate/deactivate/reactivate issue observed on various platforms.
Type of change
TESTS CONDUCTED:
DOCUMENTATION:
This PR should not require any documentation changes.
ISSUE:
CHECKLIST
LABELS (optional):
A Code Manager needs to add the following labels to this PR:
CONTRIBUTORS (optional):
@natalie-perlin @ulmonian