-
Notifications
You must be signed in to change notification settings - Fork 250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Various CCPP PRs. Improve and update CI. #541
Various CCPP PRs. Improve and update CI. #541
Conversation
* Better ec2 instance control in aux.yaml * Automatically check if PR branch is up to date with authoritative repos * Build/test matrix change to allow more than one test type in the future * Add cleanup of docker container/image before start testing * Delete files related to skip-ci and automatic cancellation of jobs * run-ci
Are you doing something unique to check for |
|
a quick glance at the python script:
|
* Add job dependency in aux.yml * Rename repo-check.sh to repo_check.sh * run-ci
I see unlimited |
CI job will have to be manually canceled on a case-by-case basis. It is hard to put a time limit that will automatically stop the loops because it all depends on the number and status of jobs in the queue and in progress. If not manually stopped, it will be cut off by the time limit of an actions job, which is 6 hours by default. I guess I could put in the six hours in the script. If you have a suggestion, please let me know. |
I don't know at the moment, but I have not thought about this. |
I agree it's not a perfect system. I have some ideas, and they will be experimented with, tested, and implemented in the future. |
@DusanJovic-NOAA , @BrianCurtis-NOAA, @aerorahul , could you please help me conduct one more testing by forking my repository (MinsukJi-NOAA:feature/CI-update) and making 2 pull requests per person? 1st pull request with run-ci in the commit message, and 2nd pull request without run-ci. Thanks. |
@MinsukJi-NOAA |
Thanks for testing @DusanJovic-NOAA. It was successful: https://github.com/MinsukJi-NOAA/ufs-weather-model/actions |
* Revert back parameters used to test in a personal repository * Change number of instances from 2 (testing) to 6 (ufs-weather-model)
This PR will not pass the CI test because it contains changes for the CI test scripts. |
Im just curious, if the CI changes are in this PR, does it not use these files when it launches the CI? |
The workflow triggered by pull_request (build_test.yml) will use the new file. However, the workflow it triggers (aux.yml) uses the file in the base repository. This is for security reasons as it has the write permission to the repository as well as access to secrets. The CI test does not work because the two workflows work in tandem. |
Thanks for the info. Much appreciated. |
Since the CI tests cannot pass as @MinsukJi-NOAA explained and all regression tests except orion.intel passed (which was under maintenance for most of the day), we can consider these PRs as ready to merge. I did start auto-bl on orion, and given how late it is we can just wait to tomorrow = Tuesday morning to do the merges. |
Machine: orion |
I checked the orion log, one test failed while creating the baseline due to an MPI timeout (a system issue as so often on orion). Let's proceed with the merge, in the meanwhile I will try to complete this manually on orion. |
CCPP pointers are not updated. Let me try again. |
You need to update your fv3atm submodule, it needs to point to hash 28888f0 in the NOAA-EMC repository. |
OK. It's done and ready for merge. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fv3atm submodule pointer is correct
@DusanJovic-NOAA please review and merge if ok - thank you! |
Note. I was able to create the 20210427 baseline for this PR on Orion retrospectively and I started to verify against it, but then the |
* Remove echo from script * Add path to result files * Add a new WE2E test for inline post in nco mode
PR Checklist
Ths PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR. Please consult the ufs-weather-model wiki if you are unsure how to do this.
This PR has been tested using a branch which is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR
An Issue describing the work contained in this PR has been created either in the subcomponent(s) or in the ufs-weather-model. The Issue should be created in the repository that is most relevant to the changes in contained in the PR. The Issue and the dependent sub-component PR
are specified below.
If new or updated input data is required by this PR, it is clearly stated in the text of the PR.
Instructions: All subsequent sections of text should be filled in as appropriate.
The information provided below allows the code managers to understand the changes relevant to this PR, whether those changes are in the ufs-weather-model repository or in a subcomponent repository. Ufs-weather-model code managers will use the information provided to add any applicable labels, assign reviewers and place it in the Commit Queue. Once the PR is in the Commit Queue, it is the PR owner's responsiblity to keep the PR up-to-date with the develop branch of ufs-weather-model.
Description
This PR includes:
Issue(s) addressed
NCAR/ccpp-physics#641
#539
Testing
Regression testing on tier-1 platforms using the newly generated baselines:
Testing of the actions workflow runs can be seen here and here.
Note also that @climbfuji compared the new baselines (20210427) on Hera for Intel and GNU to the previous ones (20210426) for the final PR. Only the tests described in NOAA-EMC/fv3atm#291 (comment) are different, as expected.
Dependencies