-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gldas fails with USE_CFP=YES on Hera #1089
Comments
I suspect the command file (also?) needs to have the CPU indices prepended to each line. This is likely an issue on other slurm machines as well. |
|
This should definitely be set to NO. This is an oversight on my part. Will remedy in a PR momentarily. |
Expected behavior
gdasgldas should run to completion with
USE_CFP=YES
Current behavior
When gdasgldas runs on Hera with
USE_CFP=YES
, it fails withMachines affected
Hera
To Reproduce
Additional Information
The Hera gdasgldas job log file indicates that CFP failed because the command file contains four entries but srun was invoked with 84 tasks. This, apparently, causes a problem on Hera. A check of operational gdasgldas log files on WCOSS2 show that CFP on WCOSS2 is OK with specifying more tasks than entries in the command file.
Possible Implementation
If it is true that on Hera the number of tasks must equal the number of entries in the command file, the gldas script(s) invoking CFP can count the number of entries in command files and execute CFP specifying that number of tasks.
For the time being, I changed
USE_CFP="YES"
in the gldas section ofHERA.env
toUSE_CFP="NO"
. The Hera gldas job runs to completion with this change.The text was updated successfully, but these errors were encountered: