Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add DKRZ as a site to RTW #3837

Open
wants to merge 19 commits into
base: main
Choose a base branch
from
Open

Add DKRZ as a site to RTW #3837

wants to merge 19 commits into from

Conversation

ehogan
Copy link
Contributor

@ehogan ehogan commented Dec 6, 2024

Description


Before you get started

Checklist

It is the responsibility of the author to make sure the pull request is ready to review. The icons indicate whether the item will be subject to the 🛠 Technical or 🧪 Scientific review.


To help with the number of pull requests:

@ehogan ehogan self-assigned this Dec 13, 2024
@ehogan ehogan added the Recipe Test Workflow (RTW) Items relevant to the Recipe Test Workflow label Dec 13, 2024
Comment on lines +41 to +51
* on DKRZ:

* add the following line to your ``~/.bashrc`` file to ensure the Cylc and
Rose executables can be found::

export PATH=/work/bd0854/metomi/bin:$PATH

* run the RTW on DKRZ::

cd ESMValTool/esmvaltool/utils/recipe_test_workflow
cylc vip -O dkrz
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here are instructions on how to run the RTW on DKRZ. Rendered version of these instructions are at https://esmvaltool--3837.org.readthedocs.build/en/3837/utils/RTW/user_guide/quick_start.html. Note that some of the process tasks are failing, I assume due to development changes since the last release. I will open new issues for those next week 👍

--wckey = RTW
--account = bk1088
--partition = interactive
--mem = 15G
Copy link
Contributor Author

@ehogan ehogan Dec 13, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have sent an e-mail to DKRZ support to ask why the memory needs to be so high (the jobs fail if the memory requested is less than this with messages like slurmstepd: error: Detected 1 oom_kill event in StepId=<process-id>.batch. Some of the step tasks have been OOM Killed.. Actual usage is (apparently) no more than 4GB 😞

@@ -0,0 +1,8 @@
[command]
default=singularity-env singularity build ${CONTAINER_PATH} ${DOCKER_SOURCE}
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Singularity works on DKRZ! Yay! 🎉

@@ -2,7 +2,8 @@
[runtime]
[[root]]
[[[environment]]]
# Warning: fragile assumption of location of esmvaltool code, see #3437
# Warning: fragile assumption of location of esmvaltool code,
# issue #3437 contains more details.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Adding link to issue here: #3437

@ehogan ehogan requested a review from sloosvel December 13, 2024 18:08
@ehogan ehogan marked this pull request as ready for review December 13, 2024 18:09
@ehogan
Copy link
Contributor Author

ehogan commented Dec 13, 2024

@sloosvel this branch should enable you to run the RTW on DKRZ! 🎉 The reason for the delay was because I was implementing the use of a container (I was using the "clone" method at the workshop in November), and I made a few rookie mistakes 😖

I have made some comments on the PR that I hope will help you with your review 😊

Note that remove_preproc_dir is set to False by default, which causes some of the compare failures. I mentioned in one of my comments that I plan to open new issues to deal with the process failures, and I would also be happy to open more new issues to deal with the compare failures 👍 unless you'd rather fix them here?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Recipe Test Workflow (RTW) Items relevant to the Recipe Test Workflow
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add DKRZ as a site to RTW
1 participant