use reframe features + fixes for various issues #11

smoors · 2022-12-06T15:02:44Z

changes:

filter out CUDA modules in non-GPU nodes and support for filtering out non-CUDA modules in GPU nodes
- requires adding features cpu and/or gpu to the partitions in the site config file
support for specifying modules
- via --setvar modules=<modulename>
support for specifying systems:partitions
- via --setvar valid_systems=<comma-separated-list>
support for overriding tasks, cpus, gpus via:
- --setvar num_tasks_per_node=<x>
- --setvar num_cpus_per_task=<y>
- --setvar num_gpus_per_node=<z>
support for setting additional environment variables
- via --setvar env_vars=<envar>:<value>

casparvl · 2023-02-10T10:55:42Z

I tested this (but only the default behaviour, no command line overriding yet). One thing I noticed is that the default behavior is different from the master branch. Given two GROMACS modules, GROMACS/2021.6-foss-2022a and GROMACS/2021.6-foss-2022a-CUDA-11.7.0, master branch generates (and runs) the following tests:

GROMACS/2021.6-foss-2022a-CUDA-11.7.0, with nb_impl=gpu on a GPU partition
GROMACS/2021.6-foss-2022a-CUDA-11.7.0, with nb_impl=cpu on a GPU partition
GROMACS/2021.6-foss-2022a, with nb_impl=cpu on a GPU partition
GROMACS/2021.6-foss-2022a, with nb_impl=cpu on a CPU partition

This PR only runs combinations 1 & 4 by default.

Of course, we might wonder if running 2 & 3 are particularly useful. I'd argue yes, for the following reasons:

If one mounts EESSI on a GPU node, the GROMACS/2021.6-foss-2022a is available. Therefore, a user should just be able to assume it's tested and working.
There may be good reasons to run certain use cases without the GPU, even if you have a GPU in your system (or at least, I wouldn't want to assume those cases don't exist) - and the end user might use either of the two modules to do so. Therefore, the CPU implementation should be tested with both modules, even if it is a GPU partition.

It's easy enough to adapt this PR to that same default behavior, but curious to know if you agree with my reasoning above :)

===

Edit: I see that it actually runs 1, 3, and 4 by default. I was confused by the listing, which only showed two checks:

- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=gpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /54be990c
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623

But of course the checks for the individual partitions are only generated later, so when I run, I get:

[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=gpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /54be990c @snellius:gpu_test+builtin
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623 @snellius:thin+builtin
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623 @snellius:gpu_test+builtin

eessi/reframe/eessi-checks/applications/gromacs_check.py

casparvl · 2023-02-10T16:55:54Z

Since we also discussed shortly on how to reduce the number of GPUs used in the tests: I do indeed think that this should be done in the ReFrame config file. Simply specify a virtual ReFrame partition that only has 1 GPU per node. E.g. we have GPU nodes with 4 GPUs per node. I would then specify e.g.:

'name': 'gpu_exclusive',
...
'access':  ['-p gpu_test --exclusive'],
...
'processor': {
                        'num_cpus': 72,
                        'num_sockets': 2,
                        'num_cpus_per_socket': 36,
                        'arch': 'icelake',
                    },
'devices': [
                        {
                            'type': 'gpu',
                            'num_devices': 4,
                        }
                    ],

if I wanted the test to use the full GPU nodes, or

'name': 'gpu_test_quarter',
...
'access':  ['-p gpu_test --gpus-per-node 1'],
...
'processor': {
                        'num_cpus': 18,
                        'num_sockets': 1,
                        'num_cpus_per_socket': 18,
                        'arch': 'icelake',
                    },
'devices': [
                        {
                            'type': 'gpu',
                            'num_devices': 1,
                        }
                    ],

to 'fake' a partition with smaller nodes. In this example, one 'node' in this virtual partition would simply be a quarter of our real node. I tested this approach for single node tests, and there it works fine.

I think this approach will even work for multinode tests, but I cannot test it: our system only allows partial node allocation for single node jobs, multinode jobs have to be exclusive.

casparvl · 2023-02-10T17:35:54Z

Ok, so I also tested the --setvar options, and all seems to work as expected. Specifically, one of the tests I ran was

reframe --config-file settings_snellius.py --checkpath test-suite/eessi/reframe/eessi-checks/ -r -R -t CI -t singlenode --setvar modules=GROMACS/2021.6-foss-2022a --setvar num_tasks_per_node=16 --setvar num_cpus_per_task=2

That generated the job:

#!/bin/bash
#SBATCH --job-name="rfm_job"
#SBATCH --ntasks=16
#SBATCH --ntasks-per-node=16
#SBATCH --cpus-per-task=2
#SBATCH --output=rfm_job.out
#SBATCH --error=rfm_job.err
#SBATCH --time=0:30:0
#SBATCH -p gpu_test --exclusive
module load GROMACS/2021.6-foss-2022a
export OMP_NUM_THREADS=2
curl -LJO https://github.com/victorusu/GROMACS_Benchmark_Suite/raw/1.0.0/HECBioSim/hEGFRDimer/benchmark.tpr
mpirun -np 16 gmx_mpi mdrun -nb cpu -s benchmark.tpr -dlb yes -ntomp 2 -npme -1

Which looks fine.

smoors · 2023-02-12T07:50:32Z

thanks for the review

this is indeed a change I forgot to discuss with you
I agree that we should ideally also test non-gpu jobs on gpu nodes

on the other hand, users (hopefully) rarely do this (unless for benchmarking), and we actively discourage it at VUB
also, our GPU nodes are in high demand, so I don’t want to test this use case too often

I changed the code so you can now choose the behavior per partition in the config file, see 0f77a7a
with this change:

if you want to test non-gpu jobs on gpu nodes, add both features 'cpu' and 'gpu' to the GPU partition
if yo only want to test gpu jobs on gpu nodes, add only 'gpu'

what do you think of this solution?

smoors · 2023-02-12T07:53:08Z

Since we also discussed shortly on how to reduce the number of GPUs used in the tests: I do indeed think that this should be done in the ReFrame config file.

yes sure. it would still be nice to also have a cmd line option for this, but that can be added later.

casparvl · 2023-02-14T11:27:43Z

if you want to test non-gpu jobs on gpu nodes, add both features 'cpu' and 'gpu' to the GPU partition
if yo only want to test gpu jobs on gpu nodes, add only 'gpu'
what do you think of this solution?

Yeah, makes a lot of sense to me. I like it if this behavior can be controlled from the config side, as it means you can 'flip' that switch in one go for all tests.

I've noticed I'm thinking very much about 'what do we want in the EESSI CI?' (probably: test everything), so it's super useful that you also provide the point of view of 'what would an end-user/sysadmin from an HPC system want?' (i.e. more limited testing to save resources) :)

I'll give this another try to check the latest changes

casparvl · 2023-02-14T16:18:54Z

I refactored the logic a bit for the set of default test combinations. As I did so, I found out I actually forgot one combination in my listing above: GROMACS/2021.6-foss-2022a-CUDA-11.7.0 with nb_impl=cpu on CPU nodes. For some software, a CUDA-enabled build means it has to have a GPU in order to run. Not so for GROMACS if we specify nb_impl=cpu - that's totally valid.

Why would we want to test this? It could be that for certain versions of GROMACS in the EESSI stack, we only have a CUDA-aware build (simply because no one bothered to do the non-CUDA-aware build). If a researchers wants to use that particular version, but only has access to a CPU machine, that's perfectly fine and should work. Therefore, it should be tested.

So, all in all, on a system with 2 modules (one cuda aware, one non-cuda aware), and 2 partitions (one with GPUs, one without), you'd get five tests by default:

[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=gpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /54be990c @snellius:gpu_test+builtin
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623 @snellius:thin+builtin
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623 @snellius:gpu_test+builtin
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /a6f0d10c @snellius:thin+builtin
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /a6f0d10c @snellius:gpu_test+builtin

I'll do a PR with my changed logic to your branch. I also stripped out the 'skip' hooks, as those are no longer needed now that we use the features for this :)

smoors · 2023-02-17T19:44:28Z

Oof, tricky. I'm a bit cautious with implementing more features for this. I'm sure we'll have quite a collection of features soon enough anyway, we should probably restrain ourselves a bit to keep things simple to understand.

yeah, you're probably right.

One could argue that the default behaviour is desireable, also on your system: the test for the CUDA module on your CPU partition will fail on your system, showing you that indeed, that module does not provide a functional GROMACS. Suppose that some end-user runs our test suite on your system, it would actually be good if they see this result.

i'm ok with it being the default, but I also want an easy way to filter those cases out without having to manually specify every combination that I do want to test.

i have another idea that could work. i'll first merge your PR into mine, and then add my idea on top.

Updated defaults pr11

smoors · 2023-02-18T13:52:51Z

@casparvl i've kept the logic that you proposed and added 3 optional variables that users can set on the cmd line: module_regex_select, module_regex_skip, and run_mode.

this ensures that all valid combinations are tested by default, and at the same time gives a lot of flexibility to run a subset of those valid combinations. i think this is a good strategy to follow.

for example, if you want to run only with non-bonded forces on the GPU (which implies only GPU partitions and CUDA modules), you can add:

--setvar run_mode=gpu

if you want to test only the CUDA modules, you can add:

--setvar module_regex_select=CUDA

if you want to run only in non-GPU partitions and only with non-CUDA modules, you can add (this works when both 'cpu' and 'gpu' features are set on the GPU partitions):

--setvar module_regex_skip=CUDA --setvar run_mode=cpu --setvar valid_systems='-gpu'

of course, if you don't have the 'cpu' feature set on GPU partitions, the last example becomes simpler:

--setvar module_regex_skip=CUDA

there are many other possibilities, such as selecting (or skipping) one or a set of toolchain generations e.g. --setvar module_regex_select='(2021b|2022a)' or a range --setvar module_regex_select='202[0-2][ab]' , and so on.

if you agree with this approach, i hope we can merge this soon. i will then work on moving as much logic as possible out of the gromacs test, for maximal reuse in other tests.

smoors · 2023-02-19T08:56:33Z

note that i chose a generic variable name for run_mode on purpose, so we don't have to define a new name for every new test (even it supports other run modes than just 'cpu' and 'gpu'). each test can then define what the possible run modes are.

casparvl · 2023-02-22T15:11:14Z

Unless I'm missing something, any of those selections can already be made with the -n and -x options of ReFrame. From the official docs

-n, --name=NAME
Filter tests by name.
NAME is interpreted as a Python Regular Expression; any test whose display name matches NAME will be selected. The display name of a test encodes also any parameterization information. See Test Naming Scheme for more details on how the tests are automatically named by the framework.
...

I.e. all parametrisation information gets included in the name of the test, and can thus be filtered on with -n and -x. Some examples:

reframe --config-file settings_snellius.py --checkpath test-suite/eessi/reframe/eessi-checks/ -r -R -t CI -t singlenode --setvar run_mode=cpu

and

reframe --config-file settings_snellius.py --checkpath test-suite/eessi/reframe/eessi-checks/ -r -R -t CI -t singlenode -n '.*cpu.*'

Result in the same set of tests:

[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623 @snellius:thin+default
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623 @snellius:gpu_test+default
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /a6f0d10c @snellius:thin+default
[ RUN      ] GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /a6f0d10c @snellius:gpu_test+default

Similarly, the equivalent of

--setvar module_regex_select=CUDA

would be

-n '.*CUDA.*'

And the equivalent of

--setvar module_regex_skip=CUDA --setvar run_mode=cpu --setvar valid_systems='-gpu'

would be

-n '.*cpu' -x '.*CUDA' -S valid_systems='-gpu'

Also, I tested

-n '(2021b|2022a)'

and this also works as expected. In fact, the documentation states this explicit case can also be achieved with the more readable

-n 2021b -n 2022a

as ReFrame selects a test if it matches any of the -n arguments.

Admittedly, the syntax of doing things with -n and -x might be slightly more involved, especilly in "A and B"-type selections such as

-n '.*gpu.*CUDA'

, but it does correctly select the test with nb_impl=gpu and the CUDA based modules.

I'd personally not be in favour of introducing more variables just to achieve a slightly simpler selection syntax. Or am I overlooking something and can you make selections with these variables that you can't do with -n and -x?

smoors · 2023-02-22T19:43:45Z

all parametrisation information gets included in the name of the test, and can thus be filtered on with -n and -x

oh, i didn't know this, thanks for pointing it out!

so the full display name in your example is this, right?
GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a

i'll check if those options indeed cover all use cases that we care about, but it probably will, as all info is in the name.

eessi/reframe/eessi_utils/hooks.py

README.md

smoors · 2023-02-23T15:57:03Z

with the latest update, the test:

by default sets the number of gpus per node based on the 'devices' attribute in the site config file
supports overriding on de cmd line with --setvar num_gpus_per_node=x

note that you should no longer explicitly set 'access': ['--gpus-per-node=x'] in the config file, otherwise overriding is broken.

please test :)

smoors · 2023-02-28T11:32:38Z

the last commit changes the default behavior of num_cpus_per_task to be equal to the total number of cpus per node divided by the total number of gpus per node.

for example, for a node with 48 cores and 4 gpus:
if you set num_gpus_per_node=1, then by default you will get num_tasks_per_node=1 and num_cpus_per_task=12
if you set num_gpus_per_node=3, then by default you will get num_tasks_per_node=3 and num_cpus_per_task=12

i think this is a more sensible default: if you use only part of the GPUs you will probably also use only part of the CPUs.

casparvl · 2023-03-02T15:38:27Z

Ok, I tested a number of combinations. For context, our GPU nodes have 4 GPUs per node, and 72 CPU cores per node.

With -n .*gpu --setvar num_gpus_per_node=1, we get:

#SBATCH --ntasks=1
#SBATCH --ntasks-per-node=1
#SBATCH --cpus-per-task=18
#SBATCH --gpus-per-node=1

With -n .*gpu --setvar num_tasks_per_node=1, we get:

#SBATCH --ntasks=2
#SBATCH --ntasks-per-node=2
#SBATCH --cpus-per-task=18
#SBATCH --gpus-per-node=2

With -n .*gpu --setvar num_cpus_per_task=2, we get:

#SBATCH --ntasks=4
#SBATCH --ntasks-per-node=4
#SBATCH --cpus-per-task=2
#SBATCH --gpus-per-node=4

With -n .*gpu --setvar num_gpus_per_node=1 --setvar num_tasks_per_node=1, we get:

#SBATCH --ntasks=1
#SBATCH --ntasks-per-node=1
#SBATCH --cpus-per-task=18
#SBATCH --gpus-per-node=1

With -n .*gpu --setvar num_gpus_per_node=1 --setvar num_tasks_per_node=2, we get:

#SBATCH --ntasks=2
#SBATCH --ntasks-per-node=2
#SBATCH --cpus-per-task=9
#SBATCH --gpus-per-node=1

With -n .*gpu --setvar num_gpus_per_node=2 --setvar num_tasks_per_node=4, we get:

#SBATCH --ntasks=4
#SBATCH --ntasks-per-node=4
#SBATCH --cpus-per-task=9
#SBATCH --gpus-per-node=2

With -n .*gpu --setvar num_gpus_per_node=1 --setvar num_tasks_per_node=1 --setvar num_cpus_per_task=2, we get:

#SBATCH --ntasks=1
#SBATCH --ntasks-per-node=1
#SBATCH --cpus-per-task=2
#SBATCH --gpus-per-node=1

All good so far. But now, what about CPU tests:

With -n .*cpu -x .*CUDA --setvar num_cpus_per_task=2

#SBATCH --ntasks=72
#SBATCH --ntasks-per-node=72
#SBATCH --cpus-per-task=2

With -n .*cpu -x .*CUDA --setvar num_tasks_per_node=36

#SBATCH --ntasks=36
#SBATCH --ntasks-per-node=36
#SBATCH --cpus-per-task=1

I haven't had the time yet to figure out where this is set exactly, and why it's not sensible.

Note that leaving everything default

-n .*cpu -x .*CUDA

#SBATCH --ntasks=72
#SBATCH --ntasks-per-node=72
#SBATCH --cpus-per-task=1

does work fine.

I'm guessing this was not the result of your latest commits - it probably was like that ever since we added the options of overriding these options. But maybe we should just fix that right away as well...?

Anyway, nice work on the overrides for the GPU counts :)

smoors · 2023-03-03T14:16:58Z

I'm guessing this was not the result of your latest commits - it probably was like that ever since we added the options of overriding these options. But maybe we should just fix that right away as well...?

indeed, my original idea was that num_cpus_per_task and num_tasks_per_node should both be set at the same time, but we can do better and make sure that by default the total number of cores per node is always requested (unless both are set of course)

the last change should fix this.
i also took the opportunity to reorganize the code a bit.

casparvl

Looks good to me. Nice milestone @smoors , finally a merge of this PR! :)

casparvl · 2023-03-07T17:38:21Z

FYI: I tested all the above combinations again, and all produce the expected and desired results now :)

satishskamath · 2023-03-09T14:05:00Z

Joining a bit late to this party. Sorry for that. I have been testing the recent merge and am not exactly getting the same results as both of you are.

[EESSI pilot 2021.06] $ reframe -C /home/satishk/projects/eessi_reframe/settings_example.py -c $eessihome/eessi-checks/applications/ -t CI -t singlen
ode -l --performance-report --setvar valid_systems=snellius_eessi:cpu
[ReFrame Setup]
  version:           4.0.1
  command:           '/sw/arch/RHEL8/EB_production/2022/software/ReFrame/4.0.1/bin/reframe -C /home/satishk/projects/eessi_reframe/settings_example.p
y -c /gpfs/home5/satishk/projects/test-suite/eessi/reframe/eessi-checks/applications/ -t CI -t singlenode -l --performance-report --setvar valid_syst
ems=snellius_eessi:cpu'
  launched by:       satishk@tcn2.local.snellius.surf.nl
  working directory: '/gpfs/home5/satishk/projects/eessi_reframe'
  settings files:    '<builtin>', '/home/satishk/projects/eessi_reframe/settings_example.py'
  check search path: '/gpfs/home5/satishk/projects/test-suite/eessi/reframe/eessi-checks/applications'
  stage directory:   '/scratch-shared/satishk/reframe_output/staging'
  output directory:  '/gpfs/home5/satishk/projects/eessi_reframe/output'
  log files:         '/gpfs/home5/satishk/projects/eessi_reframe/reframe.log'

[List of matched checks]
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=gpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /ff91f18a
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=gpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /54be99
0c
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=gpu %scale=('singlenode', 1) %module_name=GROMACS/2020.4-foss-2020a-Python-3.8.2 /0972b
423
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=gpu %scale=('singlenode', 1) %module_name=GROMACS/2020.1-foss-2020a-Python-3.8.2 /b2862
df7
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a /d12ed623
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2021.6-foss-2022a-CUDA-11.7.0 /a6f0d1
0c
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2020.4-foss-2020a-Python-3.8.2 /80c7f
af7
- GROMACS_EESSI %benchmark_info=HECBioSim/hEGFRDimer %nb_impl=cpu %scale=('singlenode', 1) %module_name=GROMACS/2020.1-foss-2020a-Python-3.8.2 /a4450
ec7
Found 8 check(s)

Log file(s) saved in '/gpfs/home5/satishk/projects/eessi_reframe/reframe.log'
[EESSI pilot 2021.06] $

with -n I get the right filtering. My question is, why does it report gpu tests when I choose the cpu partition?

smoors · 2023-03-09T16:51:59Z

@satishskamath currently, if you specify valid_systems on the command line, no automatic filtering is done, so you have to do the filtering yourself.

the reason is that unfortunately, combining systemname:partitionname with features is not supported in the current version of ReFrame (4.0.5).

now that i think about it, maybe it's possible check the features of the partitions that are specified on the cmd line, and filter the tests based on that? something to try and/or discuss..

smoors · 2023-03-10T08:59:07Z

i created an issue for this: #21

Samuel Moors added 5 commits December 6, 2022 15:56

use reframe features to select valid_systems

6352623

add support for specifying (a list of) modules

4767211

add support for specifying tasks per node

d78e2dc

fix code style for eessi_utils/utils.py

45531b2

add support for specifying valid_systems

1cac4d4

smoors force-pushed the features branch from 4d06df8 to 1cac4d4 Compare December 12, 2022 14:18

set omp_num_threads equal to cpus_per_task

1ea9e18

smoors changed the title ~~use reframe features to select valid_systems~~ use reframe features to select valid_systems + fixes for various issues Dec 12, 2022

smoors changed the title ~~use reframe features to select valid_systems + fixes for various issues~~ use reframe features + fixes for various issues Dec 12, 2022

Samuel Moors added 2 commits December 13, 2022 18:24

add support for setting custom environment variables

1c7043c

simplify valid systems logic

9d71af9

smoors force-pushed the features branch from 9e53b84 to 9d71af9 Compare December 14, 2022 09:32

Samuel Moors added 3 commits December 14, 2022 10:39

fix code style for eessi_utils/hooks.py

c0d2333

also add support for specifying num_cpus_per_task

cc73421

rearrange valid_systems filtering logic

c4b2858

casparvl reviewed Feb 10, 2023

View reviewed changes

eessi/reframe/eessi-checks/applications/gromacs_check.py Outdated Show resolved Hide resolved

Samuel Moors added 4 commits February 12, 2023 08:17

support testing non-gpu jobs on gpu nodes

0f77a7a

Merge branch 'main' into features

aa7a9a7

use env_vars rather than variables for Reframe 4

33324af

update readme

7853ff1

Samuel Moors added 2 commits February 12, 2023 08:55

improve variable name

2d93a78

update readme

4f9ae42

smoors force-pushed the features branch from 1e7294c to 4f9ae42 Compare February 12, 2023 08:51

smoors and others added 3 commits February 17, 2023 20:45

Merge pull request #1 from casparvl/updated_defaults_pr11

c767151

Updated defaults pr11

replace 'builtin' prog env with 'default' to avoid reframe-4 warning

b1e4c86

add custom variables module_regex_select, module_regex_skip, run_mode

cabf751

revert commit adding custom variables

24f22d9

casparvl requested changes Feb 23, 2023

View reviewed changes

eessi/reframe/eessi_utils/hooks.py Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

Samuel Moors added 2 commits February 23, 2023 15:34

remove backup files

c986198

properly handle gpus-per-node

ded8cc8

Samuel Moors added 3 commits February 23, 2023 17:09

update readme

65fcb42

limit gpus per node to the maximum available

0d2554b

scale number of cpus per node with number of GPUs requested

3835a87

properly scale num_tasks_per_node and num_cpus_per_task for cpu jobs

01378f0

Samuel Moors added 2 commits March 4, 2023 11:10

fix typo

fa889ec

make resource assignment more future proof against new features

0972cc1

casparvl approved these changes Mar 7, 2023

View reviewed changes

casparvl merged commit 3b05600 into EESSI:main Mar 7, 2023

smoors deleted the features branch March 9, 2023 16:24

boegel added this to the 2023Q1 milestone May 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use reframe features + fixes for various issues #11

use reframe features + fixes for various issues #11

smoors commented Dec 6, 2022 •

edited

Loading

casparvl commented Feb 10, 2023 •

edited

Loading

casparvl commented Feb 10, 2023 •

edited

Loading

casparvl commented Feb 10, 2023

smoors commented Feb 12, 2023

smoors commented Feb 12, 2023 •

edited

Loading

casparvl commented Feb 14, 2023

casparvl commented Feb 14, 2023 •

edited

Loading

smoors commented Feb 17, 2023 •

edited

Loading

smoors commented Feb 18, 2023 •

edited

Loading

smoors commented Feb 19, 2023 •

edited

Loading

casparvl commented Feb 22, 2023 •

edited

Loading

smoors commented Feb 22, 2023

smoors commented Feb 23, 2023 •

edited

Loading

smoors commented Feb 28, 2023 •

edited

Loading

casparvl commented Mar 2, 2023 •

edited

Loading

smoors commented Mar 3, 2023 •

edited

Loading

casparvl left a comment

casparvl commented Mar 7, 2023

satishskamath commented Mar 9, 2023

smoors commented Mar 9, 2023

smoors commented Mar 10, 2023

use reframe features + fixes for various issues #11

use reframe features + fixes for various issues #11

Conversation

smoors commented Dec 6, 2022 • edited Loading

casparvl commented Feb 10, 2023 • edited Loading

casparvl commented Feb 10, 2023 • edited Loading

casparvl commented Feb 10, 2023

smoors commented Feb 12, 2023

smoors commented Feb 12, 2023 • edited Loading

casparvl commented Feb 14, 2023

casparvl commented Feb 14, 2023 • edited Loading

smoors commented Feb 17, 2023 • edited Loading

smoors commented Feb 18, 2023 • edited Loading

smoors commented Feb 19, 2023 • edited Loading

casparvl commented Feb 22, 2023 • edited Loading

smoors commented Feb 22, 2023

smoors commented Feb 23, 2023 • edited Loading

smoors commented Feb 28, 2023 • edited Loading

casparvl commented Mar 2, 2023 • edited Loading

smoors commented Mar 3, 2023 • edited Loading

casparvl left a comment

Choose a reason for hiding this comment

casparvl commented Mar 7, 2023

satishskamath commented Mar 9, 2023

smoors commented Mar 9, 2023

smoors commented Mar 10, 2023

smoors commented Dec 6, 2022 •

edited

Loading

casparvl commented Feb 10, 2023 •

edited

Loading

casparvl commented Feb 10, 2023 •

edited

Loading

smoors commented Feb 12, 2023 •

edited

Loading

casparvl commented Feb 14, 2023 •

edited

Loading

smoors commented Feb 17, 2023 •

edited

Loading

smoors commented Feb 18, 2023 •

edited

Loading

smoors commented Feb 19, 2023 •

edited

Loading

casparvl commented Feb 22, 2023 •

edited

Loading

smoors commented Feb 23, 2023 •

edited

Loading

smoors commented Feb 28, 2023 •

edited

Loading

casparvl commented Mar 2, 2023 •

edited

Loading

smoors commented Mar 3, 2023 •

edited

Loading