{2023.06} foss/2022b #309

casparvl · 2023-07-18T21:26:10Z

No description provided.

eessi-bot · 2023-07-18T21:26:13Z

Instance eessi-bot-citc-aws is configured to build:

arch x86_64/generic for repo eessi-2021.12
arch x86_64/generic for repo eessi-2023.06-compat
arch x86_64/generic for repo eessi-2023.06-software
arch x86_64/intel/haswell for repo eessi-2021.12
arch x86_64/intel/haswell for repo eessi-2023.06-compat
arch x86_64/intel/haswell for repo eessi-2023.06-software
arch x86_64/intel/skylake_avx512 for repo eessi-2021.12
arch x86_64/intel/skylake_avx512 for repo eessi-2023.06-compat
arch x86_64/intel/skylake_avx512 for repo eessi-2023.06-software
arch x86_64/amd/zen2 for repo eessi-2021.12
arch x86_64/amd/zen2 for repo eessi-2023.06-compat
arch x86_64/amd/zen2 for repo eessi-2023.06-software
arch x86_64/amd/zen3 for repo eessi-2021.12
arch x86_64/amd/zen3 for repo eessi-2023.06-compat
arch x86_64/amd/zen3 for repo eessi-2023.06-software
arch aarch64/generic for repo eessi-2021.12
arch aarch64/generic for repo eessi-2023.06-compat
arch aarch64/generic for repo eessi-2023.06-software
arch aarch64/neoverse_n1 for repo eessi-2021.12
arch aarch64/neoverse_n1 for repo eessi-2023.06-compat
arch aarch64/neoverse_n1 for repo eessi-2023.06-software
arch aarch64/neoverse_v1 for repo eessi-2021.12
arch aarch64/neoverse_v1 for repo eessi-2023.06-compat
arch aarch64/neoverse_v1 for repo eessi-2023.06-software

casparvl · 2023-07-18T21:27:31Z

bot: build repo:eessi-2023.06-software arch:x86_64/generic

eessi-bot · 2023-07-18T21:27:33Z

Updates by the bot instance eessi-bot-citc-aws (click for details)

received bot command build repo:eessi-2023.06-software arch:x86_64/generic from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:x86_64/generic

casparvl · 2023-07-18T21:50:59Z

bot: build repo:eessi-2023.06-software arch:x86_64/generic

eessi-bot · 2023-07-18T21:51:01Z

Updates by the bot instance eessi-bot-citc-aws (click for details)

received bot command build repo:eessi-2023.06-software arch:x86_64/generic from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:x86_64/generic
handling command build repository:eessi-2023.06-software architecture:x86_64/generic resulted in:
- submitted job 5976, for details & status see {2023.06} foss/2022b #309 (comment)

eessi-bot · 2023-07-18T21:51:07Z

New job on instance eessi-bot-citc-aws for architecture x86_64-generic for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5976

date	job status	comment
Jul 18 21:51:06 UTC 2023	submitted	job id `5976` awaits release by job manager
Jul 18 21:52:10 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 18 21:55:14 UTC 2023	running	job `5976` is running
Jul 18 23:00:52 UTC 2023	finished	😢 FAILURE (click triangle for details) Details ✅ job output file `slurm-5976.out` ❌ found message matching `ERROR:` ❌ found message matching `FAILED:` ❌ found message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-x86_64-generic-1689721175.tar.gz` size: 7 MiB (8361989 bytes) entries: 201 modules under 2023.06/software/linux/x86_64/generic/modules/all `FFTW/3.3.10-GCC-12.2.0.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` software under 2023.06/software/linux/x86_64/generic/software `FFTW/3.3.10-GCC-12.2.0` `libarchive/3.6.1-GCCcore-12.2.0` other under 2023.06/software/linux/x86_64/generic `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`

casparvl · 2023-07-18T23:38:16Z

bot: build repo:eessi-2023.06-software arch:x86_64/generic

eessi-bot · 2023-07-18T23:38:18Z

Updates by the bot instance eessi-bot-citc-aws (click for details)

received bot command build repo:eessi-2023.06-software arch:x86_64/generic from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:x86_64/generic
handling command build repository:eessi-2023.06-software architecture:x86_64/generic resulted in:
- submitted job 5979, for details & status see {2023.06} foss/2022b #309 (comment)

eessi-bot · 2023-07-18T23:38:25Z

New job on instance eessi-bot-citc-aws for architecture x86_64-generic for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5979

date	job status	comment
Jul 18 23:38:24 UTC 2023	submitted	job id `5979` awaits release by job manager
Jul 18 23:39:02 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 18 23:42:08 UTC 2023	running	job `5979` is running
Jul 19 02:05:06 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-5979.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-x86_64-generic-1689732235.tar.gz` size: 155 MiB (162851275 bytes) entries: 14531 modules under 2023.06/software/linux/x86_64/generic/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/x86_64/generic/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/x86_64/generic `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:29:32 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-x86_64-generic-1689732235.tar.gz` to S3 bucket succeeded

casparvl · 2023-07-19T11:08:12Z

bot: build repo:eessi-2023.06-software arch:x86_64/intel/haswell
bot: build repo:eessi-2023.06-software arch:x86_64/intel/skylake_avx512
bot: build repo:eessi-2023.06-software arch:x86_64/amd/zen3
bot: build repo:eessi-2023.06-software arch:x86_64/amd/zen2
bot: build repo:eessi-2023.06-software arch:aarch64/generic
bot: build repo:eessi-2023.06-software arch:aarch64/neoverse_v1
bot: build repo:eessi-2023.06-software arch:aarch64/neoverse_n1

eessi-bot · 2023-07-19T11:08:15Z

Updates by the bot instance eessi-bot-citc-aws (click for details)

received bot command build repo:eessi-2023.06-software arch:x86_64/intel/haswell from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:x86_64/intel/haswell
received bot command build repo:eessi-2023.06-software arch:x86_64/intel/skylake_avx512 from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:x86_64/intel/skylake_avx512
received bot command build repo:eessi-2023.06-software arch:x86_64/amd/zen3 from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:x86_64/amd/zen3
received bot command build repo:eessi-2023.06-software arch:x86_64/amd/zen2 from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:x86_64/amd/zen2
received bot command build repo:eessi-2023.06-software arch:aarch64/generic from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:aarch64/generic
received bot command build repo:eessi-2023.06-software arch:aarch64/neoverse_v1 from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1
received bot command build repo:eessi-2023.06-software arch:aarch64/neoverse_n1 from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:aarch64/neoverse_n1
handling command build repository:eessi-2023.06-software architecture:x86_64/intel/haswell resulted in:
- submitted job 5980, for details & status see {2023.06} foss/2022b #309 (comment)
handling command build repository:eessi-2023.06-software architecture:x86_64/intel/skylake_avx512 resulted in:
- submitted job 5981, for details & status see {2023.06} foss/2022b #309 (comment)
handling command build repository:eessi-2023.06-software architecture:x86_64/amd/zen3 resulted in:
- submitted job 5984, for details & status see {2023.06} foss/2022b #309 (comment)
handling command build repository:eessi-2023.06-software architecture:x86_64/amd/zen2 resulted in:
- submitted job 5985, for details & status see {2023.06} foss/2022b #309 (comment)
handling command build repository:eessi-2023.06-software architecture:aarch64/generic resulted in:
- submitted job 5987, for details & status see {2023.06} foss/2022b #309 (comment)
handling command build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1 resulted in:
- submitted job 5990, for details & status see {2023.06} foss/2022b #309 (comment)
handling command build repository:eessi-2023.06-software architecture:aarch64/neoverse_n1 resulted in:
- submitted job 5991, for details & status see {2023.06} foss/2022b #309 (comment)

eessi-bot · 2023-07-19T11:08:22Z

New job on instance eessi-bot-citc-aws for architecture x86_64-intel-haswell for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5980

date	job status	comment
Jul 19 11:08:21 UTC 2023	submitted	job id `5980` awaits release by job manager
Jul 19 11:08:35 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 19 11:12:10 UTC 2023	running	job `5980` is running
Jul 19 13:07:45 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-5980.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-x86_64-intel-haswell-1689771986.tar.gz` size: 149 MiB (157196837 bytes) entries: 14531 modules under 2023.06/software/linux/x86_64/intel/haswell/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/x86_64/intel/haswell/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/x86_64/intel/haswell `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:29:22 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-x86_64-intel-haswell-1689771986.tar.gz` to S3 bucket succeeded

eessi-bot · 2023-07-19T11:08:29Z

New job on instance eessi-bot-citc-aws for architecture x86_64-intel-skylake_avx512 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5981

date	job status	comment
Jul 19 11:08:28 UTC 2023	submitted	job id `5981` awaits release by job manager
Jul 19 11:08:33 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 19 11:12:08 UTC 2023	running	job `5981` is running
Jul 19 12:45:17 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-5981.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1689770638.tar.gz` size: 149 MiB (157271616 bytes) entries: 14531 modules under 2023.06/software/linux/x86_64/intel/skylake_avx512/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/x86_64/intel/skylake_avx512/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/x86_64/intel/skylake_avx512 `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:30:01 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1689770638.tar.gz` to S3 bucket succeeded

eessi-bot · 2023-07-19T11:08:36Z

New job on instance eessi-bot-citc-aws for architecture x86_64-amd-zen3 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5984

date	job status	comment
Jul 19 11:08:36 UTC 2023	submitted	job id `5984` awaits release by job manager
Jul 19 11:10:01 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 19 11:12:17 UTC 2023	running	job `5984` is running
Jul 19 12:24:20 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-5984.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-x86_64-amd-zen3-1689769363.tar.gz` size: 149 MiB (156869857 bytes) entries: 14531 modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/x86_64/amd/zen3/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/x86_64/amd/zen3 `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:29:12 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-x86_64-amd-zen3-1689769363.tar.gz` to S3 bucket succeeded

eessi-bot · 2023-07-19T11:08:45Z

New job on instance eessi-bot-citc-aws for architecture x86_64-amd-zen2 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5985

date	job status	comment
Jul 19 11:08:44 UTC 2023	submitted	job id `5985` awaits release by job manager
Jul 19 11:09:58 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 19 11:12:16 UTC 2023	running	job `5985` is running
Jul 19 12:59:28 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-5985.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-x86_64-amd-zen2-1689771474.tar.gz` size: 149 MiB (156890490 bytes) entries: 14531 modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/x86_64/amd/zen2/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/x86_64/amd/zen2 `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:29:51 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-x86_64-amd-zen2-1689771474.tar.gz` to S3 bucket succeeded

eessi-bot · 2023-07-19T11:08:52Z

New job on instance eessi-bot-citc-aws for architecture aarch64-generic for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5987

date	job status	comment
Jul 19 11:08:51 UTC 2023	submitted	job id `5987` awaits release by job manager
Jul 19 11:09:52 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 19 11:13:35 UTC 2023	running	job `5987` is running
Jul 19 12:39:37 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-5987.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-aarch64-generic-1689770284.tar.gz` size: 145 MiB (152356330 bytes) entries: 14510 modules under 2023.06/software/linux/aarch64/generic/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/aarch64/generic/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/aarch64/generic `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:29:41 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-aarch64-generic-1689770284.tar.gz` to S3 bucket succeeded

eessi-bot · 2023-07-19T11:08:59Z

New job on instance eessi-bot-citc-aws for architecture aarch64-neoverse_v1 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5990

date	job status	comment
Jul 19 11:08:58 UTC 2023	submitted	job id `5990` awaits release by job manager
Jul 19 11:09:44 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 19 11:13:30 UTC 2023	running	job `5990` is running
Jul 19 12:18:21 UTC 2023	finished	😢 FAILURE (click triangle for details) Details ✅ job output file `slurm-5990.out` ❌ found message matching `ERROR:` ❌ found message matching `FAILED:` ❌ found message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-aarch64-neoverse_v1-1689768996.tar.gz` size: 114 MiB (120126306 bytes) entries: 14321 modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/aarch64/neoverse_v1/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/aarch64/neoverse_v1 `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`

eessi-bot · 2023-07-19T11:09:06Z

New job on instance eessi-bot-citc-aws for architecture aarch64-neoverse_n1 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5991

date	job status	comment
Jul 19 11:09:05 UTC 2023	submitted	job id `5991` awaits release by job manager
Jul 19 11:09:42 UTC 2023	released	job awaits launch by Slurm scheduler
Jul 19 11:13:28 UTC 2023	running	job `5991` is running
Jul 19 12:38:27 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-5991.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-aarch64-neoverse_n1-1689770207.tar.gz` size: 138 MiB (144942690 bytes) entries: 14510 modules under 2023.06/software/linux/aarch64/neoverse_n1/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/aarch64/neoverse_n1/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/aarch64/neoverse_n1 `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:29:03 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-aarch64-neoverse_n1-1689770207.tar.gz` to S3 bucket succeeded

casparvl · 2023-07-19T12:50:55Z

The failure on neoverse_v1:

0:42:15  10 out of 14 easyconfigs done: BLIS/0.9.0-GCC-12.2.0 (OK), Python/3.10.ERROR: Build of /cvmfs/pilot.eessi-hpc.org/versions/2023.06/software/linux/aarch64/neoverse_v1/software/EasyBuild/4.7.2/easybuild/easyconfigs/o/OpenBLAS/OpenBLAS-0.3.21-GCC-12.2.0.eb failed (err: 'build failed (first 300 chars): Too many LAPACK tests failed due to numerical errors: 344 (> 300)')

bedroge · 2023-07-19T13:28:30Z

The failure on neoverse_v1:

0:42:15  10 out of 14 easyconfigs done: BLIS/0.9.0-GCC-12.2.0 (OK), Python/3.10.ERROR: Build of /cvmfs/pilot.eessi-hpc.org/versions/2023.06/software/linux/aarch64/neoverse_v1/software/EasyBuild/4.7.2/easybuild/easyconfigs/o/OpenBLAS/OpenBLAS-0.3.21-GCC-12.2.0.eb failed (err: 'build failed (first 300 chars): Too many LAPACK tests failed due to numerical errors: 344 (> 300)')

So apparently the increase to 300 here is not enough for this version. Do we increase it a bit more (don't know how much sense it makes to just ignore more and more tests?)?

casparvl · 2023-07-20T08:31:13Z

I don't know either. I guess we can, but it does make one wonder: if numerical results are different, how different are they, and is that still acceptable? We should at the very least check that all failures are numerical failures, I guess.

I think the more fundamental question is: what should are tests guarantee?

If they should only guarantee that the installation went ok, I think we can ignore the numerical errors: this is simply the behavior of this OpenBLAS version on neoverse_v1. "Fixing" that is not up to us.
If they should guarantee that software produces the right result, we should not deploy it on neoverse_v1, and only deploy a version there once these numerical inconsistencies have been resolved (which it might never be if the OpenBLAS devs don't consider it a prolbem).

It's probably good to at least report an issue upstream, see what they say. I'd assume the devs are more adept at judging whether these numerical inconsistencies should be considered problematic or not. One issue with taking the first approach is that issues may also pop up in other packages, such as #306

casparvl · 2023-07-20T08:41:41Z

More detailed error:

                        -->   LAPACK TESTING SUMMARY  <--
SUMMARY                 nb test run     numerical error         other error
================        ===========     =================       ================
REAL                    1315683         107     (0.008%)        0       (0.000%)
DOUBLE PRECISION        1314777         66      (0.005%)        0       (0.000%)
COMPLEX                 773609          97      (0.013%)        0       (0.000%)
COMPLEX16               776246          74      (0.010%)        0       (0.000%)

--> ALL PRECISIONS      4180315         344     (0.008%)        0       (0.000%)


== 2023-07-19 12:16:15,506 openblas.py:113 INFO 4180315 LAPACK tests run - 344 failed due to numerical errors - 0 failed due to other errors

So it is all numerical failures. That is a little bit encouraging...

casparvl · 2023-08-04T17:26:09Z

I managed to go into the singularity container by unpacking the tarball at /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5990/previous_tmp

doing a singularity shell ghcr.io_eessi_build_node_debian11.sif

and find the detailed testing results at bot/easybuild/build/OpenBLAS/0.3.21/GCC-12.2.0/OpenBLAS-0.3.21/lapack-netlib/TESTING/testing_results.txt. An excerpt:

...
Matrix order=   15, type=10, seed= 798,1691,2423, 745, result  5 is 8.389E+06
 Matrix order=   15, type=11, seed= 931,1787, 557,2429, result  5 is 8.389E+06
 Matrix order=   15, type=17, seed= 529,3615,1764,1221, result  5 is 8.389E+06
 Matrix order=   15, type=18, seed=3991, 625,3539,2581, result  5 is 8.389E+06
 Matrix order=   15, type=19, seed=2400,1821, 218,1365, result  5 is 8.389E+06
 Matrix order=   15, type=20, seed=2155,2073,3686, 149, result  5 is 8.389E+06
 Matrix order=   15, type=21, seed=3823,2194,3510,3029, result  5 is 8.389E+06
 Matrix order=   15, type=22, seed=2951, 608,1256,3481, result  5 is 8.389E+06
 Matrix order=   15, type=23, seed=1243,2927, 263, 357, result  5 is 8.389E+06
 Matrix order=   15, type=24, seed=1487,3956,2976,3649, result  5 is 8.389E+06
 Matrix order=   15, type=25, seed=2605,2211,3982,2721, result  5 is 8.389E+06
 Matrix order=   15, type=26, seed=   8,1600,1726,2817, result  5 is 8.389E+06
 Matrix order=   20, type= 8, seed= 624, 693,1681,   9, result  5 is 8.389E+06
 Matrix order=   20, type=10, seed= 563,2705, 428, 633, result  5 is 8.389E+06
 Matrix order=   20, type=12, seed= 112,1059,2403,2121, result  5 is 8.389E+06
 Matrix order=   20, type=13, seed=3448,3524, 410,2697, result  5 is 8.389E+06
 Matrix order=   20, type=14, seed= 248,   6,3354,3273, result  5 is 8.389E+06
 Matrix order=   20, type=17, seed= 723,4024,2193,2265, result  5 is 8.389E+06
 Matrix order=   20, type=18, seed= 828, 842,1787,3733, result  5 is 8.389E+06
 Matrix order=   20, type=19, seed= 252,3327,3300,1389, result  5 is 8.389E+06
 Matrix order=   20, type=20, seed=1904,2346,3199,2437, result  5 is 8.389E+06
 Matrix order=   20, type=21, seed=4066,  26,1253, 221, result  5 is 8.389E+06
 Matrix order=   20, type=22, seed=3234,3664,3237,3473, result  5 is 8.389E+06
 Matrix order=   20, type=23, seed= 481, 957,3230,3689, result  5 is 8.389E+06
 Matrix order=   20, type=24, seed=2871,2618,1987,2949, result  5 is 8.389E+06
 Matrix order=   20, type=25, seed=2604,3233, 644,3529, result  5 is 8.389E+06
 Matrix order=   20, type=26, seed=1465,2266,3976,3833, result  5 is 8.389E+06
 SGV drivers:     64 out of   1092 tests failed to pass the threshold
...
 Matrix order=   20, type=10, seed= 563,2705, 428, 633, result  5 is 4.504D+15
 Matrix order=   20, type=12, seed= 112,1059,2403,2121, result  5 is 4.504D+15
 Matrix order=   20, type=13, seed=3448,3524, 410,2697, result  5 is 4.504D+15
 Matrix order=   20, type=17, seed= 723,4024,2193,2265, result  5 is 4.504D+15
 Matrix order=   20, type=18, seed= 828, 842,1787,3733, result  5 is 4.504D+15
 Matrix order=   20, type=19, seed= 252,3327,3300,1389, result  5 is 4.504D+15
 Matrix order=   20, type=20, seed=1904,2346,3199,2437, result  5 is 4.504D+15
 Matrix order=   20, type=21, seed=4066,  26,1253, 221, result  5 is 4.504D+15
 Matrix order=   20, type=22, seed=3234,3664,3237,3473, result  5 is 4.504D+15
 Matrix order=   20, type=23, seed= 481, 957,3230,3689, result  5 is 4.504D+15
 Matrix order=   20, type=24, seed=2871,2618,1987,2949, result  5 is 4.504D+15
 Matrix order=   20, type=25, seed=2604,3233, 644,3529, result  5 is 4.504D+15
 Matrix order=   20, type=26, seed=1465,2266,3976,3833, result  5 is 4.504D+15
 DGV drivers:     34 out of   1092 tests failed to pass the threshold
...
 Matrix order=    6, type=16, seed= 852,1170,4018,2529, result  5 is 4.504D+15
 Matrix order=    8, type=20, seed= 202,3818,2830,2373, result  5 is 4.504D+15
 Matrix order=    8, type=25, seed= 582,1680, 289,1745, result  5 is 4.504D+15
 Matrix order=   10, type= 8, seed= 573,2931,1057, 529, result  5 is 4.504D+15
 Matrix order=   10, type=10, seed= 308, 212,3333,  73, result  5 is 4.504D+15
 Matrix order=   10, type=13, seed=3036,2372,2403,2421, result  5 is 4.504D+15
 Matrix order=   15, type=12, seed=2288,1863,3505, 805, result  5 is 4.504D+15
 Matrix order=   15, type=13, seed=3415,2880,2965,2889, result  5 is 4.504D+15
 Matrix order=   15, type=16, seed= 681,2778,2914,2225, result  5 is 4.504D+15
 Matrix order=   15, type=17, seed= 304,3863, 350,1657, result  5 is 4.504D+15
 Matrix order=   15, type=21, seed=2362, 980, 534,3365, result  5 is 4.504D+15
 Matrix order=   15, type=25, seed=2888,1855, 900,2225, result  5 is 4.504D+15
 Matrix order=   20, type= 8, seed= 636, 119,  47,2269, result  5 is 4.504D+15
 Matrix order=   20, type= 9, seed=1007,2268,2913,  29, result  5 is 4.504D+15
 Matrix order=   20, type=10, seed= 104,1378,2159,1485, result  5 is 4.504D+15
 Matrix order=   20, type=11, seed= 927, 874,3720,2349, result  5 is 4.504D+15
 Matrix order=   20, type=14, seed=2659,2147, 552,1661, result  5 is 4.504D+15
 Matrix order=   20, type=21, seed=2012,3064,3728,2789, result  5 is 4.504D+15
 DGV drivers:     18 out of   1092 tests failed to pass the threshold
...
 N=  30 M=   0, P=   5, type  8, test  2, ratio=  555331.
 N=   3 M=   0, P=  20, type  8, test  2, ratio= 0.889025E+07
 N=  30 M=   0, P=  20, type  8, test  2, ratio= 0.219820E+07
 M=   3 P=   0, N=  30, type  7, test  1, ratio= 0.800042E+07
 N=  30 M=   3, P=   0, type  7, test  1, ratio= 0.772279E+07
 M=   3 P=   5, N=  30, type  7, test  1, ratio= 0.852669E+07
 N=  30 M=   3, P=   5, type  8, test  2, ratio= 0.119398E+07
 N=   3 M=   3, P=  20, type  8, test  2, ratio= 0.998240E+07
 N=  30 M=   3, P=  20, type  8, test  2, ratio=  976876.
 N=   3 M=  10, P=   5, type  8, test  2, ratio= 0.121475E+08
 N=  30 M=  10, P=   5, type  7, test  1, ratio= 0.127652E+08
 N=  30 M=  10, P=   5, type  8, test  2, ratio=  714258.
 M=  10 P=  20, N=  30, type  7, test  1, ratio= 0.510405E+07
 N=  30 M=  10, P=  20, type  8, test  2, ratio= 0.208493E+07
 GQR:     14 out of   1728 tests failed to pass the threshold
...
 Matrix order=   20, type=16, seed= 647,2328,1944, 557, result  5 is 8.389E+06
 CGV drivers:      1 out of   1092 tests failed to pass the threshold
...
 CHESV , UPLO='U', N =    3, type  2, test  1, ratio = 0.10874E+07
 CHESVX, FACT='N', UPLO='U', N =    3, type  2, test  1, ratio = 0.10874E+07
 CHESV , UPLO='U', N =    5, type  9, test  1, ratio = 0.32042E+07
 CHESVX, FACT='N', UPLO='U', N =    5, type  9, test  1, ratio = 0.32042E+07
 CHESV , UPLO='U', N =    5, type 10, test  1, ratio = 0.35394E+07
 CHESVX, FACT='N', UPLO='U', N =    5, type 10, test  1, ratio = 0.35394E+07
 CHESV , UPLO='U', N =   10, type  2, test  1, ratio = 0.16458E+07
 CHESVX, FACT='N', UPLO='U', N =   10, type  2, test  1, ratio = 0.16458E+07
 CHESV , UPLO='U', N =   10, type  7, test  1, ratio = 0.14266E+06
 CHESVX, FACT='N', UPLO='U', N =   10, type  7, test  1, ratio = 0.14266E+06
 CHESV , UPLO='U', N =   10, type  9, test  1, ratio = 0.31372E+07
 CHESVX, FACT='N', UPLO='U', N =   10, type  9, test  1, ratio = 0.31372E+07
 CHESV , UPLO='U', N =   10, type 10, test  1, ratio = 0.16814E+07
 CHESVX, FACT='N', UPLO='U', N =   10, type 10, test  1, ratio = 0.16814E+07
 CHESV , UPLO='U', N =   50, type  2, test  1, ratio = 0.86168E+07
 CHESVX, FACT='N', UPLO='U', N =   50, type  2, test  1, ratio = 0.86168E+07
 CHESV , UPLO='U', N =   50, type  7, test  1, ratio = 0.80869E+06
 CHESVX, FACT='N', UPLO='U', N =   50, type  7, test  1, ratio = 0.80869E+06
 CHESV , UPLO='U', N =   50, type  8, test  1, ratio = 0.52818E+06
 CHESVX, FACT='N', UPLO='U', N =   50, type  8, test  1, ratio = 0.52818E+06
 CHESV , UPLO='U', N =   50, type  9, test  1, ratio = 0.85655E+07
 CHESVX, FACT='N', UPLO='U', N =   50, type  9, test  1, ratio = 0.85655E+07
 CHESV , UPLO='U', N =   50, type 10, test  1, ratio = 0.32095E+07
 CHESVX, FACT='N', UPLO='U', N =   50, type 10, test  1, ratio = 0.32095E+07
 CHE drivers:     24 out of   1072 tests failed to pass the threshold
...
 Matrix order=   12, type=24, seed=2693,2404,3046,2957, result  5 is 4.504D+15
 Matrix order=   12, type=25, seed= 578, 723, 929,1637, result  5 is 4.504D+15
 Matrix order=   12, type=26, seed=2061,1512,1968, 125, result  5 is 4.504D+15
 Matrix order=   20, type=16, seed= 647,2328,1944, 557, result  5 is 4.504D+15
 Matrix order=   20, type=17, seed=1585,3902,3906,1293, result  5 is 4.504D+15
 Matrix order=   20, type=18, seed=1063,2113,3640,2685, result  5 is 4.504D+15
 Matrix order=   20, type=19, seed=2305,3879, 305, 381, result  5 is 4.504D+15
 Matrix order=   20, type=20, seed= 897,3616, 121,2173, result  5 is 4.504D+15
 Matrix order=   20, type=21, seed= 676,3851,3089,3965, result  5 is 4.504D+15
 Matrix order=   20, type=22, seed=1763,2855,1954,1469, result  5 is 4.504D+15
 Matrix order=   20, type=23, seed=1338,3437,3180,3285, result  5 is 4.504D+15
 Matrix order=   20, type=24, seed=3522,1685,3785, 813, result  5 is 4.504D+15
 Matrix order=   20, type=25, seed=2931,3978,3195,3781, result  5 is 4.504D+15
 Matrix order=   20, type=26, seed= 449,1038,1670,1437, result  5 is 4.504D+15
 ZGV drivers:     53 out of   1092 tests failed to pass the threshold

Now, I'm no expert in these tests, and have no idea what this means. It does not look like small numerical errors to my untrained eye, but I might be completely wrong...

boegel · 2023-08-04T19:15:09Z

@casparvl To avoid getting stuck on this even longer, I think we should:

Bump the limit for failing numerical tests for OpenBLAS to 350 or 400 in our hooks (see here), so the test step passes. The LAPACK test suite summary clearly show that just 0.008% of all (numerical) tests fail, I strongly feel that's not serious enough to block the installation, especially since this only happens for aarch64/neoverse_v1, and that we'll have a clear record of this in the EasyBuild log file that is included in the installation.
Open an issue in this repository to keep track of the fact that we're seeing a bit more numerical failures for OpenBLAS on aarch64 than we do on amd or intel systems, with all the details (like the stuff you mentioned in your last comment, since this PR is not the right place to have this discussion imho), and also compare things across easyconfig generations/toolchains.
Report this upstream to OpenBLAS, with sufficient detail, and keep track of the feedback in our own issue - we could probably use some help from @bartoldeman here if he can find the time for it, since he has some experience there.

We should take a similar approach to unblock other PRs (numpy in #306, FFTW in #297 and #310), and push on to discover other issues which no doubt will pop up.

We can't reasonable expect that we'll figure out each and every failing test for all software we'll install, especially because we known that some test suite are quite buggy themselves (PyTorch comes to mind, but this also applies to LAPACK, see @bartoldeman's PR which dealt with a non-numerical failure in the LAPACK test suite, see also issue #18017).

This procedure should probably be documented as well, and even become part of the contribution policy, with some rules of thumb on when this approach is acceptable (how many failing tests we see, on how many CPU targets, etc.).

…M. See EESSI#309

casparvl · 2023-08-08T10:50:16Z

bot: build repo:eessi-2023.06-software arch:aarch64/neoverse_v1

eessi-bot · 2023-08-08T10:50:19Z

Updates by the bot instance eessi-bot-citc-aws (click for details)

received bot command build repo:eessi-2023.06-software arch:aarch64/neoverse_v1 from casparvl
- expanded format: build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1
handling command build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1 resulted in:
- submitted job 6354, for details & status see {2023.06} foss/2022b #309 (comment)

eessi-bot · 2023-08-08T10:50:26Z

New job on instance eessi-bot-citc-aws for architecture aarch64-neoverse_v1 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.08/pr_309/6354

date	job status	comment
Aug 08 10:50:25 UTC 2023	submitted	job id `6354` awaits release by job manager
Aug 08 10:50:32 UTC 2023	released	job awaits launch by Slurm scheduler
Aug 08 10:54:34 UTC 2023	running	job `6354` is running
Aug 08 12:07:49 UTC 2023	finished	😁 SUCCESS (click triangle for details) Details ✅ job output file `slurm-6354.out` ✅ no message matching `ERROR:` ✅ no message matching `FAILED:` ✅ no message matching `required modules missing:` ✅ found message(s) matching `No missing installations` ✅ found message matching `.tar.gz created!` Artefacts `eessi-2023.06-software-linux-aarch64-neoverse_v1-1691496382.tar.gz` size: 136 MiB (143344312 bytes) entries: 14510 modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all `BLIS/0.9.0-GCC-12.2.0.lua` `CMake/3.24.3-GCCcore-12.2.0.lua` `FFTW/3.3.10-GCC-12.2.0.lua` `FFTW.MPI/3.3.10-gompi-2022b.lua` `FlexiBLAS/3.2.1-GCC-12.2.0.lua` `foss/2022b.lua` `gompi/2022b.lua` `libarchive/3.6.1-GCCcore-12.2.0.lua` `libffi/3.4.4-GCCcore-12.2.0.lua` `make/4.3-GCCcore-12.2.0.lua` `OpenBLAS/0.3.21-GCC-12.2.0.lua` `Python/3.10.8-GCCcore-12.2.0-bare.lua` `ScaLAPACK/2.2.0-gompi-2022b-fb.lua` `SQLite/3.39.4-GCCcore-12.2.0.lua` `Tcl/8.6.12-GCCcore-12.2.0.lua` `UnZip/6.0-GCCcore-12.2.0.lua` software under 2023.06/software/linux/aarch64/neoverse_v1/software `BLIS/0.9.0-GCC-12.2.0` `CMake/3.24.3-GCCcore-12.2.0` `FFTW/3.3.10-GCC-12.2.0` `FFTW.MPI/3.3.10-gompi-2022b` `FlexiBLAS/3.2.1-GCC-12.2.0` `foss/2022b` `gompi/2022b` `libarchive/3.6.1-GCCcore-12.2.0` `libffi/3.4.4-GCCcore-12.2.0` `make/4.3-GCCcore-12.2.0` `OpenBLAS/0.3.21-GCC-12.2.0` `Python/3.10.8-GCCcore-12.2.0-bare` `ScaLAPACK/2.2.0-gompi-2022b-fb` `SQLite/3.39.4-GCCcore-12.2.0` `Tcl/8.6.12-GCCcore-12.2.0` `UnZip/6.0-GCCcore-12.2.0` other under 2023.06/software/linux/aarch64/neoverse_v1 `.lmod/cache/spiderT.lua` `.lmod/cache/spiderT.luac_5.1` `.lmod/cache/timestamp`
Aug 09 12:30:11 UTC 2023	uploaded	transfer of `eessi-2023.06-software-linux-aarch64-neoverse_v1-1691496382.tar.gz` to S3 bucket succeeded

casparvl · 2023-08-08T10:57:28Z

Created issue for the OpenBLAS test failures:

#314

eb_hooks.py

remove Lmod cache update

Added foss-2022b

c2160ec

casparvl changed the title ~~Added foss-2022b~~ {2023.06}[foss-2022b] Added foss-2022b Jul 18, 2023

Merge branch '2023.06' into foss_2022b

4d9c476

casparvl added the pilot-2023.06 label Jul 18, 2023

Added cmake explicitely, since it needs an include-easyblocks-from-pr

45df462

Increased number of tests that are allowed to fail for OpenBLAS on AR…

c23fd93

…M. See EESSI#309

casparvl mentioned this pull request Aug 8, 2023

OpenBLAS test suite failures on ARM neoverse_v1 #314

Open

boegel reviewed Aug 8, 2023

View reviewed changes

eb_hooks.py Show resolved Hide resolved

boegel changed the title ~~{2023.06}[foss-2022b] Added foss-2022b~~ {2023.06} foss/2022b Aug 8, 2023

Added reference to ticket for increase in accepted test failures

36f5b30

casparvl mentioned this pull request Aug 8, 2023

{2023.06} foss/2022a #310

Merged

boegel added the bot:deploy Ask bot to deploy missing software installations to EESSI label Aug 9, 2023

boegel approved these changes Aug 9, 2023

View reviewed changes

boegel merged commit b694709 into EESSI:2023.06 Aug 9, 2023

trz42 pushed a commit to trz42/software-layer that referenced this pull request Apr 7, 2024

Merge pull request EESSI#309 from trz42/nessi_remove_lmod_cache_update

17a6e90

remove Lmod cache update

casparvl deleted the foss_2022b branch August 15, 2024 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

{2023.06} foss/2022b #309

{2023.06} foss/2022b #309

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

casparvl commented Jul 19, 2023

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

casparvl commented Jul 19, 2023

bedroge commented Jul 19, 2023

casparvl commented Jul 20, 2023

casparvl commented Jul 20, 2023 •

edited

Loading

casparvl commented Aug 4, 2023

boegel commented Aug 4, 2023

casparvl commented Aug 8, 2023

eessi-bot bot commented Aug 8, 2023 •

edited

Loading

eessi-bot bot commented Aug 8, 2023 •

edited

Loading

casparvl commented Aug 8, 2023

{2023.06} foss/2022b #309

{2023.06} foss/2022b #309

Conversation

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023 • edited Loading

eessi-bot bot commented Jul 18, 2023 • edited Loading

casparvl commented Jul 18, 2023

eessi-bot bot commented Jul 18, 2023 • edited Loading

eessi-bot bot commented Jul 18, 2023 • edited Loading

casparvl commented Jul 19, 2023

eessi-bot bot commented Jul 19, 2023 • edited Loading

eessi-bot bot commented Jul 19, 2023 • edited Loading

eessi-bot bot commented Jul 19, 2023 • edited Loading

eessi-bot bot commented Jul 19, 2023 • edited Loading

eessi-bot bot commented Jul 19, 2023 • edited Loading

eessi-bot bot commented Jul 19, 2023 • edited Loading

eessi-bot bot commented Jul 19, 2023 • edited Loading

eessi-bot bot commented Jul 19, 2023 • edited Loading

casparvl commented Jul 19, 2023

bedroge commented Jul 19, 2023

casparvl commented Jul 20, 2023

casparvl commented Jul 20, 2023 • edited Loading

casparvl commented Aug 4, 2023

boegel commented Aug 4, 2023

casparvl commented Aug 8, 2023

eessi-bot bot commented Aug 8, 2023 • edited Loading

eessi-bot bot commented Aug 8, 2023 • edited Loading

casparvl commented Aug 8, 2023

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

eessi-bot bot commented Jul 18, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

eessi-bot bot commented Jul 19, 2023 •

edited

Loading

casparvl commented Jul 20, 2023 •

edited

Loading

eessi-bot bot commented Aug 8, 2023 •

edited

Loading

eessi-bot bot commented Aug 8, 2023 •

edited

Loading