-
Notifications
You must be signed in to change notification settings - Fork 0
Sync meeting on EESSI test suite (2023 04 20)
Caspar van Leeuwen edited this page Apr 20, 2023
·
1 revision
- every 2 weeks on Thursday at 14:00 CE(S)T
- next meetings:
- Thu 30 March 14:00 => OK
- Thu 20 April 14:00 => OK
- Wed 3 May 14:00 => reschedule? (now clashes with monthly EasyBuild 5.0 sync meeting)
- Wed 17 May 14:00 => OK
- Wed 31 May 14:00 => OK
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-03-30)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-03-10) (incl. 2023-02-23)
- https://github.com/EESSI/meetings/wiki/Sync-meeting-on-EESSI-test-suite-(2023-02-09)
-
OSU tests
- Based on the internal OSU tests from SURF. Question: should it be based on HPCtestlib OSU test?
- Satish will have a good look at the OSU test from HPCtestlib and see if we can use it (or components from it)
- Status: works for CPU pt2pt, not for GPU yet.
-
TensorFlow test
- Python script is there, it works for GPU, still to be tested if it shows right behavior on CPU
- Still need to make it into a ReFrame test
- Uses mpi4py to figure out the local rank of the process, bind the process to a single GPU, etc (because tf.distribute has no awareness of MPI)
-
PR follow-up
- https://github.com/EESSI/test-suite/pull/26 Caspar needs te review again to check the chances by Sam
- https://github.com/EESSI/test-suite/pull/28 Needs #26 before it can really be tested, it can be visually reviewed
- https://github.com/EESSI/test-suite/pull/23 Merged, implements adding additional executable options
-
Next steps
- Start thinking about portable performance references / checks https://github.com/EESSI/test-suite/issues/31