Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{2023.06} foss/2022b #309

Merged
merged 5 commits into from
Aug 9, 2023
Merged

{2023.06} foss/2022b #309

merged 5 commits into from
Aug 9, 2023

Conversation

casparvl
Copy link
Collaborator

No description provided.

@eessi-bot
Copy link

eessi-bot bot commented Jul 18, 2023

Instance eessi-bot-citc-aws is configured to build:

  • arch x86_64/generic for repo eessi-2021.12
  • arch x86_64/generic for repo eessi-2023.06-compat
  • arch x86_64/generic for repo eessi-2023.06-software
  • arch x86_64/intel/haswell for repo eessi-2021.12
  • arch x86_64/intel/haswell for repo eessi-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi-2021.12
  • arch x86_64/intel/skylake_avx512 for repo eessi-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi-2021.12
  • arch x86_64/amd/zen2 for repo eessi-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi-2021.12
  • arch x86_64/amd/zen3 for repo eessi-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi-2023.06-software
  • arch aarch64/generic for repo eessi-2021.12
  • arch aarch64/generic for repo eessi-2023.06-compat
  • arch aarch64/generic for repo eessi-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi-2021.12
  • arch aarch64/neoverse_n1 for repo eessi-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi-2021.12
  • arch aarch64/neoverse_v1 for repo eessi-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi-2023.06-software

@casparvl casparvl changed the title Added foss-2022b {2023.06}[foss-2022b] Added foss-2022b Jul 18, 2023
@casparvl
Copy link
Collaborator Author

bot: build repo:eessi-2023.06-software arch:x86_64/generic

@eessi-bot
Copy link

eessi-bot bot commented Jul 18, 2023

Updates by the bot instance eessi-bot-citc-aws (click for details)
  • received bot command build repo:eessi-2023.06-software arch:x86_64/generic from casparvl
    • expanded format: build repository:eessi-2023.06-software architecture:x86_64/generic

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi-2023.06-software arch:x86_64/generic

@eessi-bot
Copy link

eessi-bot bot commented Jul 18, 2023

Updates by the bot instance eessi-bot-citc-aws (click for details)
  • received bot command build repo:eessi-2023.06-software arch:x86_64/generic from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:x86_64/generic
  • handling command build repository:eessi-2023.06-software architecture:x86_64/generic resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Jul 18, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-generic for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5976

date job status comment
Jul 18 21:51:06 UTC 2023 submitted job id 5976 awaits release by job manager
Jul 18 21:52:10 UTC 2023 released job awaits launch by Slurm scheduler
Jul 18 21:55:14 UTC 2023 running job 5976 is running
Jul 18 23:00:52 UTC 2023 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-5976.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-generic-1689721175.tar.gzsize: 7 MiB (8361989 bytes)
entries: 201
modules under 2023.06/software/linux/x86_64/generic/modules/all
FFTW/3.3.10-GCC-12.2.0.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
software under 2023.06/software/linux/x86_64/generic/software
FFTW/3.3.10-GCC-12.2.0
libarchive/3.6.1-GCCcore-12.2.0
other under 2023.06/software/linux/x86_64/generic
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi-2023.06-software arch:x86_64/generic

@eessi-bot
Copy link

eessi-bot bot commented Jul 18, 2023

Updates by the bot instance eessi-bot-citc-aws (click for details)
  • received bot command build repo:eessi-2023.06-software arch:x86_64/generic from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:x86_64/generic
  • handling command build repository:eessi-2023.06-software architecture:x86_64/generic resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Jul 18, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-generic for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5979

date job status comment
Jul 18 23:38:24 UTC 2023 submitted job id 5979 awaits release by job manager
Jul 18 23:39:02 UTC 2023 released job awaits launch by Slurm scheduler
Jul 18 23:42:08 UTC 2023 running job 5979 is running
Jul 19 02:05:06 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-5979.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-generic-1689732235.tar.gzsize: 155 MiB (162851275 bytes)
entries: 14531
modules under 2023.06/software/linux/x86_64/generic/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/x86_64/generic/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/x86_64/generic
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:29:32 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-generic-1689732235.tar.gz to S3 bucket succeeded

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi-2023.06-software arch:x86_64/intel/haswell
bot: build repo:eessi-2023.06-software arch:x86_64/intel/skylake_avx512
bot: build repo:eessi-2023.06-software arch:x86_64/amd/zen3
bot: build repo:eessi-2023.06-software arch:x86_64/amd/zen2
bot: build repo:eessi-2023.06-software arch:aarch64/generic
bot: build repo:eessi-2023.06-software arch:aarch64/neoverse_v1
bot: build repo:eessi-2023.06-software arch:aarch64/neoverse_n1

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

Updates by the bot instance eessi-bot-citc-aws (click for details)
  • received bot command build repo:eessi-2023.06-software arch:x86_64/intel/haswell from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:x86_64/intel/haswell
  • received bot command build repo:eessi-2023.06-software arch:x86_64/intel/skylake_avx512 from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:x86_64/intel/skylake_avx512
  • received bot command build repo:eessi-2023.06-software arch:x86_64/amd/zen3 from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:x86_64/amd/zen3
  • received bot command build repo:eessi-2023.06-software arch:x86_64/amd/zen2 from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:x86_64/amd/zen2
  • received bot command build repo:eessi-2023.06-software arch:aarch64/generic from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:aarch64/generic
  • received bot command build repo:eessi-2023.06-software arch:aarch64/neoverse_v1 from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1
  • received bot command build repo:eessi-2023.06-software arch:aarch64/neoverse_n1 from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:aarch64/neoverse_n1
  • handling command build repository:eessi-2023.06-software architecture:x86_64/intel/haswell resulted in:

  • handling command build repository:eessi-2023.06-software architecture:x86_64/intel/skylake_avx512 resulted in:

  • handling command build repository:eessi-2023.06-software architecture:x86_64/amd/zen3 resulted in:

  • handling command build repository:eessi-2023.06-software architecture:x86_64/amd/zen2 resulted in:

  • handling command build repository:eessi-2023.06-software architecture:aarch64/generic resulted in:

  • handling command build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1 resulted in:

  • handling command build repository:eessi-2023.06-software architecture:aarch64/neoverse_n1 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-intel-haswell for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5980

date job status comment
Jul 19 11:08:21 UTC 2023 submitted job id 5980 awaits release by job manager
Jul 19 11:08:35 UTC 2023 released job awaits launch by Slurm scheduler
Jul 19 11:12:10 UTC 2023 running job 5980 is running
Jul 19 13:07:45 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-5980.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-haswell-1689771986.tar.gzsize: 149 MiB (157196837 bytes)
entries: 14531
modules under 2023.06/software/linux/x86_64/intel/haswell/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/x86_64/intel/haswell/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/x86_64/intel/haswell
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:29:22 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-haswell-1689771986.tar.gz to S3 bucket succeeded

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-intel-skylake_avx512 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5981

date job status comment
Jul 19 11:08:28 UTC 2023 submitted job id 5981 awaits release by job manager
Jul 19 11:08:33 UTC 2023 released job awaits launch by Slurm scheduler
Jul 19 11:12:08 UTC 2023 running job 5981 is running
Jul 19 12:45:17 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-5981.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1689770638.tar.gzsize: 149 MiB (157271616 bytes)
entries: 14531
modules under 2023.06/software/linux/x86_64/intel/skylake_avx512/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/x86_64/intel/skylake_avx512/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/x86_64/intel/skylake_avx512
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:30:01 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1689770638.tar.gz to S3 bucket succeeded

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-amd-zen3 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5984

date job status comment
Jul 19 11:08:36 UTC 2023 submitted job id 5984 awaits release by job manager
Jul 19 11:10:01 UTC 2023 released job awaits launch by Slurm scheduler
Jul 19 11:12:17 UTC 2023 running job 5984 is running
Jul 19 12:24:20 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-5984.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1689769363.tar.gzsize: 149 MiB (156869857 bytes)
entries: 14531
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen3/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/x86_64/amd/zen3
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:29:12 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen3-1689769363.tar.gz to S3 bucket succeeded

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-amd-zen2 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5985

date job status comment
Jul 19 11:08:44 UTC 2023 submitted job id 5985 awaits release by job manager
Jul 19 11:09:58 UTC 2023 released job awaits launch by Slurm scheduler
Jul 19 11:12:16 UTC 2023 running job 5985 is running
Jul 19 12:59:28 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-5985.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1689771474.tar.gzsize: 149 MiB (156890490 bytes)
entries: 14531
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen2/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/x86_64/amd/zen2
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:29:51 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-1689771474.tar.gz to S3 bucket succeeded

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-generic for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5987

date job status comment
Jul 19 11:08:51 UTC 2023 submitted job id 5987 awaits release by job manager
Jul 19 11:09:52 UTC 2023 released job awaits launch by Slurm scheduler
Jul 19 11:13:35 UTC 2023 running job 5987 is running
Jul 19 12:39:37 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-5987.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1689770284.tar.gzsize: 145 MiB (152356330 bytes)
entries: 14510
modules under 2023.06/software/linux/aarch64/generic/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/aarch64/generic/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/aarch64/generic
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:29:41 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-aarch64-generic-1689770284.tar.gz to S3 bucket succeeded

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-neoverse_v1 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5990

date job status comment
Jul 19 11:08:58 UTC 2023 submitted job id 5990 awaits release by job manager
Jul 19 11:09:44 UTC 2023 released job awaits launch by Slurm scheduler
Jul 19 11:13:30 UTC 2023 running job 5990 is running
Jul 19 12:18:21 UTC 2023 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-5990.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_v1-1689768996.tar.gzsize: 114 MiB (120126306 bytes)
entries: 14321
modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/aarch64/neoverse_v1/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/aarch64/neoverse_v1
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp

@eessi-bot
Copy link

eessi-bot bot commented Jul 19, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-neoverse_n1 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5991

date job status comment
Jul 19 11:09:05 UTC 2023 submitted job id 5991 awaits release by job manager
Jul 19 11:09:42 UTC 2023 released job awaits launch by Slurm scheduler
Jul 19 11:13:28 UTC 2023 running job 5991 is running
Jul 19 12:38:27 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-5991.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_n1-1689770207.tar.gzsize: 138 MiB (144942690 bytes)
entries: 14510
modules under 2023.06/software/linux/aarch64/neoverse_n1/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/aarch64/neoverse_n1/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/aarch64/neoverse_n1
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:29:03 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-aarch64-neoverse_n1-1689770207.tar.gz to S3 bucket succeeded

@casparvl
Copy link
Collaborator Author

The failure on neoverse_v1:

0:42:15  10 out of 14 easyconfigs done: BLIS/0.9.0-GCC-12.2.0 (OK), Python/3.10.ERROR: Build of /cvmfs/pilot.eessi-hpc.org/versions/2023.06/software/linux/aarch64/neoverse_v1/software/EasyBuild/4.7.2/easybuild/easyconfigs/o/OpenBLAS/OpenBLAS-0.3.21-GCC-12.2.0.eb failed (err: 'build failed (first 300 chars): Too many LAPACK tests failed due to numerical errors: 344 (> 300)')

@bedroge
Copy link
Collaborator

bedroge commented Jul 19, 2023

The failure on neoverse_v1:

0:42:15  10 out of 14 easyconfigs done: BLIS/0.9.0-GCC-12.2.0 (OK), Python/3.10.ERROR: Build of /cvmfs/pilot.eessi-hpc.org/versions/2023.06/software/linux/aarch64/neoverse_v1/software/EasyBuild/4.7.2/easybuild/easyconfigs/o/OpenBLAS/OpenBLAS-0.3.21-GCC-12.2.0.eb failed (err: 'build failed (first 300 chars): Too many LAPACK tests failed due to numerical errors: 344 (> 300)')

So apparently the increase to 300 here is not enough for this version. Do we increase it a bit more (don't know how much sense it makes to just ignore more and more tests?)?

@casparvl
Copy link
Collaborator Author

I don't know either. I guess we can, but it does make one wonder: if numerical results are different, how different are they, and is that still acceptable? We should at the very least check that all failures are numerical failures, I guess.

I think the more fundamental question is: what should are tests guarantee?

  1. If they should only guarantee that the installation went ok, I think we can ignore the numerical errors: this is simply the behavior of this OpenBLAS version on neoverse_v1. "Fixing" that is not up to us.
  2. If they should guarantee that software produces the right result, we should not deploy it on neoverse_v1, and only deploy a version there once these numerical inconsistencies have been resolved (which it might never be if the OpenBLAS devs don't consider it a prolbem).

It's probably good to at least report an issue upstream, see what they say. I'd assume the devs are more adept at judging whether these numerical inconsistencies should be considered problematic or not. One issue with taking the first approach is that issues may also pop up in other packages, such as #306

@casparvl
Copy link
Collaborator Author

casparvl commented Jul 20, 2023

More detailed error:

                        -->   LAPACK TESTING SUMMARY  <--
SUMMARY                 nb test run     numerical error         other error
================        ===========     =================       ================
REAL                    1315683         107     (0.008%)        0       (0.000%)
DOUBLE PRECISION        1314777         66      (0.005%)        0       (0.000%)
COMPLEX                 773609          97      (0.013%)        0       (0.000%)
COMPLEX16               776246          74      (0.010%)        0       (0.000%)

--> ALL PRECISIONS      4180315         344     (0.008%)        0       (0.000%)


== 2023-07-19 12:16:15,506 openblas.py:113 INFO 4180315 LAPACK tests run - 344 failed due to numerical errors - 0 failed due to other errors

So it is all numerical failures. That is a little bit encouraging...

@casparvl
Copy link
Collaborator Author

casparvl commented Aug 4, 2023

I managed to go into the singularity container by unpacking the tarball at /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.07/pr_309/5990/previous_tmp

doing a singularity shell ghcr.io_eessi_build_node_debian11.sif

and find the detailed testing results at bot/easybuild/build/OpenBLAS/0.3.21/GCC-12.2.0/OpenBLAS-0.3.21/lapack-netlib/TESTING/testing_results.txt. An excerpt:

...
Matrix order=   15, type=10, seed= 798,1691,2423, 745, result  5 is 8.389E+06
 Matrix order=   15, type=11, seed= 931,1787, 557,2429, result  5 is 8.389E+06
 Matrix order=   15, type=17, seed= 529,3615,1764,1221, result  5 is 8.389E+06
 Matrix order=   15, type=18, seed=3991, 625,3539,2581, result  5 is 8.389E+06
 Matrix order=   15, type=19, seed=2400,1821, 218,1365, result  5 is 8.389E+06
 Matrix order=   15, type=20, seed=2155,2073,3686, 149, result  5 is 8.389E+06
 Matrix order=   15, type=21, seed=3823,2194,3510,3029, result  5 is 8.389E+06
 Matrix order=   15, type=22, seed=2951, 608,1256,3481, result  5 is 8.389E+06
 Matrix order=   15, type=23, seed=1243,2927, 263, 357, result  5 is 8.389E+06
 Matrix order=   15, type=24, seed=1487,3956,2976,3649, result  5 is 8.389E+06
 Matrix order=   15, type=25, seed=2605,2211,3982,2721, result  5 is 8.389E+06
 Matrix order=   15, type=26, seed=   8,1600,1726,2817, result  5 is 8.389E+06
 Matrix order=   20, type= 8, seed= 624, 693,1681,   9, result  5 is 8.389E+06
 Matrix order=   20, type=10, seed= 563,2705, 428, 633, result  5 is 8.389E+06
 Matrix order=   20, type=12, seed= 112,1059,2403,2121, result  5 is 8.389E+06
 Matrix order=   20, type=13, seed=3448,3524, 410,2697, result  5 is 8.389E+06
 Matrix order=   20, type=14, seed= 248,   6,3354,3273, result  5 is 8.389E+06
 Matrix order=   20, type=17, seed= 723,4024,2193,2265, result  5 is 8.389E+06
 Matrix order=   20, type=18, seed= 828, 842,1787,3733, result  5 is 8.389E+06
 Matrix order=   20, type=19, seed= 252,3327,3300,1389, result  5 is 8.389E+06
 Matrix order=   20, type=20, seed=1904,2346,3199,2437, result  5 is 8.389E+06
 Matrix order=   20, type=21, seed=4066,  26,1253, 221, result  5 is 8.389E+06
 Matrix order=   20, type=22, seed=3234,3664,3237,3473, result  5 is 8.389E+06
 Matrix order=   20, type=23, seed= 481, 957,3230,3689, result  5 is 8.389E+06
 Matrix order=   20, type=24, seed=2871,2618,1987,2949, result  5 is 8.389E+06
 Matrix order=   20, type=25, seed=2604,3233, 644,3529, result  5 is 8.389E+06
 Matrix order=   20, type=26, seed=1465,2266,3976,3833, result  5 is 8.389E+06
 SGV drivers:     64 out of   1092 tests failed to pass the threshold
...
 Matrix order=   20, type=10, seed= 563,2705, 428, 633, result  5 is 4.504D+15
 Matrix order=   20, type=12, seed= 112,1059,2403,2121, result  5 is 4.504D+15
 Matrix order=   20, type=13, seed=3448,3524, 410,2697, result  5 is 4.504D+15
 Matrix order=   20, type=17, seed= 723,4024,2193,2265, result  5 is 4.504D+15
 Matrix order=   20, type=18, seed= 828, 842,1787,3733, result  5 is 4.504D+15
 Matrix order=   20, type=19, seed= 252,3327,3300,1389, result  5 is 4.504D+15
 Matrix order=   20, type=20, seed=1904,2346,3199,2437, result  5 is 4.504D+15
 Matrix order=   20, type=21, seed=4066,  26,1253, 221, result  5 is 4.504D+15
 Matrix order=   20, type=22, seed=3234,3664,3237,3473, result  5 is 4.504D+15
 Matrix order=   20, type=23, seed= 481, 957,3230,3689, result  5 is 4.504D+15
 Matrix order=   20, type=24, seed=2871,2618,1987,2949, result  5 is 4.504D+15
 Matrix order=   20, type=25, seed=2604,3233, 644,3529, result  5 is 4.504D+15
 Matrix order=   20, type=26, seed=1465,2266,3976,3833, result  5 is 4.504D+15
 DGV drivers:     34 out of   1092 tests failed to pass the threshold
...
 Matrix order=    6, type=16, seed= 852,1170,4018,2529, result  5 is 4.504D+15
 Matrix order=    8, type=20, seed= 202,3818,2830,2373, result  5 is 4.504D+15
 Matrix order=    8, type=25, seed= 582,1680, 289,1745, result  5 is 4.504D+15
 Matrix order=   10, type= 8, seed= 573,2931,1057, 529, result  5 is 4.504D+15
 Matrix order=   10, type=10, seed= 308, 212,3333,  73, result  5 is 4.504D+15
 Matrix order=   10, type=13, seed=3036,2372,2403,2421, result  5 is 4.504D+15
 Matrix order=   15, type=12, seed=2288,1863,3505, 805, result  5 is 4.504D+15
 Matrix order=   15, type=13, seed=3415,2880,2965,2889, result  5 is 4.504D+15
 Matrix order=   15, type=16, seed= 681,2778,2914,2225, result  5 is 4.504D+15
 Matrix order=   15, type=17, seed= 304,3863, 350,1657, result  5 is 4.504D+15
 Matrix order=   15, type=21, seed=2362, 980, 534,3365, result  5 is 4.504D+15
 Matrix order=   15, type=25, seed=2888,1855, 900,2225, result  5 is 4.504D+15
 Matrix order=   20, type= 8, seed= 636, 119,  47,2269, result  5 is 4.504D+15
 Matrix order=   20, type= 9, seed=1007,2268,2913,  29, result  5 is 4.504D+15
 Matrix order=   20, type=10, seed= 104,1378,2159,1485, result  5 is 4.504D+15
 Matrix order=   20, type=11, seed= 927, 874,3720,2349, result  5 is 4.504D+15
 Matrix order=   20, type=14, seed=2659,2147, 552,1661, result  5 is 4.504D+15
 Matrix order=   20, type=21, seed=2012,3064,3728,2789, result  5 is 4.504D+15
 DGV drivers:     18 out of   1092 tests failed to pass the threshold
...
 N=  30 M=   0, P=   5, type  8, test  2, ratio=  555331.
 N=   3 M=   0, P=  20, type  8, test  2, ratio= 0.889025E+07
 N=  30 M=   0, P=  20, type  8, test  2, ratio= 0.219820E+07
 M=   3 P=   0, N=  30, type  7, test  1, ratio= 0.800042E+07
 N=  30 M=   3, P=   0, type  7, test  1, ratio= 0.772279E+07
 M=   3 P=   5, N=  30, type  7, test  1, ratio= 0.852669E+07
 N=  30 M=   3, P=   5, type  8, test  2, ratio= 0.119398E+07
 N=   3 M=   3, P=  20, type  8, test  2, ratio= 0.998240E+07
 N=  30 M=   3, P=  20, type  8, test  2, ratio=  976876.
 N=   3 M=  10, P=   5, type  8, test  2, ratio= 0.121475E+08
 N=  30 M=  10, P=   5, type  7, test  1, ratio= 0.127652E+08
 N=  30 M=  10, P=   5, type  8, test  2, ratio=  714258.
 M=  10 P=  20, N=  30, type  7, test  1, ratio= 0.510405E+07
 N=  30 M=  10, P=  20, type  8, test  2, ratio= 0.208493E+07
 GQR:     14 out of   1728 tests failed to pass the threshold
...
 Matrix order=   20, type=16, seed= 647,2328,1944, 557, result  5 is 8.389E+06
 CGV drivers:      1 out of   1092 tests failed to pass the threshold
...
 CHESV , UPLO='U', N =    3, type  2, test  1, ratio = 0.10874E+07
 CHESVX, FACT='N', UPLO='U', N =    3, type  2, test  1, ratio = 0.10874E+07
 CHESV , UPLO='U', N =    5, type  9, test  1, ratio = 0.32042E+07
 CHESVX, FACT='N', UPLO='U', N =    5, type  9, test  1, ratio = 0.32042E+07
 CHESV , UPLO='U', N =    5, type 10, test  1, ratio = 0.35394E+07
 CHESVX, FACT='N', UPLO='U', N =    5, type 10, test  1, ratio = 0.35394E+07
 CHESV , UPLO='U', N =   10, type  2, test  1, ratio = 0.16458E+07
 CHESVX, FACT='N', UPLO='U', N =   10, type  2, test  1, ratio = 0.16458E+07
 CHESV , UPLO='U', N =   10, type  7, test  1, ratio = 0.14266E+06
 CHESVX, FACT='N', UPLO='U', N =   10, type  7, test  1, ratio = 0.14266E+06
 CHESV , UPLO='U', N =   10, type  9, test  1, ratio = 0.31372E+07
 CHESVX, FACT='N', UPLO='U', N =   10, type  9, test  1, ratio = 0.31372E+07
 CHESV , UPLO='U', N =   10, type 10, test  1, ratio = 0.16814E+07
 CHESVX, FACT='N', UPLO='U', N =   10, type 10, test  1, ratio = 0.16814E+07
 CHESV , UPLO='U', N =   50, type  2, test  1, ratio = 0.86168E+07
 CHESVX, FACT='N', UPLO='U', N =   50, type  2, test  1, ratio = 0.86168E+07
 CHESV , UPLO='U', N =   50, type  7, test  1, ratio = 0.80869E+06
 CHESVX, FACT='N', UPLO='U', N =   50, type  7, test  1, ratio = 0.80869E+06
 CHESV , UPLO='U', N =   50, type  8, test  1, ratio = 0.52818E+06
 CHESVX, FACT='N', UPLO='U', N =   50, type  8, test  1, ratio = 0.52818E+06
 CHESV , UPLO='U', N =   50, type  9, test  1, ratio = 0.85655E+07
 CHESVX, FACT='N', UPLO='U', N =   50, type  9, test  1, ratio = 0.85655E+07
 CHESV , UPLO='U', N =   50, type 10, test  1, ratio = 0.32095E+07
 CHESVX, FACT='N', UPLO='U', N =   50, type 10, test  1, ratio = 0.32095E+07
 CHE drivers:     24 out of   1072 tests failed to pass the threshold
...
 Matrix order=   12, type=24, seed=2693,2404,3046,2957, result  5 is 4.504D+15
 Matrix order=   12, type=25, seed= 578, 723, 929,1637, result  5 is 4.504D+15
 Matrix order=   12, type=26, seed=2061,1512,1968, 125, result  5 is 4.504D+15
 Matrix order=   20, type=16, seed= 647,2328,1944, 557, result  5 is 4.504D+15
 Matrix order=   20, type=17, seed=1585,3902,3906,1293, result  5 is 4.504D+15
 Matrix order=   20, type=18, seed=1063,2113,3640,2685, result  5 is 4.504D+15
 Matrix order=   20, type=19, seed=2305,3879, 305, 381, result  5 is 4.504D+15
 Matrix order=   20, type=20, seed= 897,3616, 121,2173, result  5 is 4.504D+15
 Matrix order=   20, type=21, seed= 676,3851,3089,3965, result  5 is 4.504D+15
 Matrix order=   20, type=22, seed=1763,2855,1954,1469, result  5 is 4.504D+15
 Matrix order=   20, type=23, seed=1338,3437,3180,3285, result  5 is 4.504D+15
 Matrix order=   20, type=24, seed=3522,1685,3785, 813, result  5 is 4.504D+15
 Matrix order=   20, type=25, seed=2931,3978,3195,3781, result  5 is 4.504D+15
 Matrix order=   20, type=26, seed= 449,1038,1670,1437, result  5 is 4.504D+15
 ZGV drivers:     53 out of   1092 tests failed to pass the threshold

Now, I'm no expert in these tests, and have no idea what this means. It does not look like small numerical errors to my untrained eye, but I might be completely wrong...

@boegel
Copy link
Contributor

boegel commented Aug 4, 2023

@casparvl To avoid getting stuck on this even longer, I think we should:

  1. Bump the limit for failing numerical tests for OpenBLAS to 350 or 400 in our hooks (see here), so the test step passes. The LAPACK test suite summary clearly show that just 0.008% of all (numerical) tests fail, I strongly feel that's not serious enough to block the installation, especially since this only happens for aarch64/neoverse_v1, and that we'll have a clear record of this in the EasyBuild log file that is included in the installation.

  2. Open an issue in this repository to keep track of the fact that we're seeing a bit more numerical failures for OpenBLAS on aarch64 than we do on amd or intel systems, with all the details (like the stuff you mentioned in your last comment, since this PR is not the right place to have this discussion imho), and also compare things across easyconfig generations/toolchains.

  3. Report this upstream to OpenBLAS, with sufficient detail, and keep track of the feedback in our own issue - we could probably use some help from @bartoldeman here if he can find the time for it, since he has some experience there.

We should take a similar approach to unblock other PRs (numpy in #306, FFTW in #297 and #310), and push on to discover other issues which no doubt will pop up.

We can't reasonable expect that we'll figure out each and every failing test for all software we'll install, especially because we known that some test suite are quite buggy themselves (PyTorch comes to mind, but this also applies to LAPACK, see @bartoldeman's PR which dealt with a non-numerical failure in the LAPACK test suite, see also issue #18017).

This procedure should probably be documented as well, and even become part of the contribution policy, with some rules of thumb on when this approach is acceptable (how many failing tests we see, on how many CPU targets, etc.).

@casparvl
Copy link
Collaborator Author

casparvl commented Aug 8, 2023

bot: build repo:eessi-2023.06-software arch:aarch64/neoverse_v1

@eessi-bot
Copy link

eessi-bot bot commented Aug 8, 2023

Updates by the bot instance eessi-bot-citc-aws (click for details)
  • received bot command build repo:eessi-2023.06-software arch:aarch64/neoverse_v1 from casparvl

    • expanded format: build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1
  • handling command build repository:eessi-2023.06-software architecture:aarch64/neoverse_v1 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Aug 8, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-neoverse_v1 for repository eessi-2023.06-software in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.08/pr_309/6354

date job status comment
Aug 08 10:50:25 UTC 2023 submitted job id 6354 awaits release by job manager
Aug 08 10:50:32 UTC 2023 released job awaits launch by Slurm scheduler
Aug 08 10:54:34 UTC 2023 running job 6354 is running
Aug 08 12:07:49 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-6354.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_v1-1691496382.tar.gzsize: 136 MiB (143344312 bytes)
entries: 14510
modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all
BLIS/0.9.0-GCC-12.2.0.lua
CMake/3.24.3-GCCcore-12.2.0.lua
FFTW/3.3.10-GCC-12.2.0.lua
FFTW.MPI/3.3.10-gompi-2022b.lua
FlexiBLAS/3.2.1-GCC-12.2.0.lua
foss/2022b.lua
gompi/2022b.lua
libarchive/3.6.1-GCCcore-12.2.0.lua
libffi/3.4.4-GCCcore-12.2.0.lua
make/4.3-GCCcore-12.2.0.lua
OpenBLAS/0.3.21-GCC-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0-bare.lua
ScaLAPACK/2.2.0-gompi-2022b-fb.lua
SQLite/3.39.4-GCCcore-12.2.0.lua
Tcl/8.6.12-GCCcore-12.2.0.lua
UnZip/6.0-GCCcore-12.2.0.lua
software under 2023.06/software/linux/aarch64/neoverse_v1/software
BLIS/0.9.0-GCC-12.2.0
CMake/3.24.3-GCCcore-12.2.0
FFTW/3.3.10-GCC-12.2.0
FFTW.MPI/3.3.10-gompi-2022b
FlexiBLAS/3.2.1-GCC-12.2.0
foss/2022b
gompi/2022b
libarchive/3.6.1-GCCcore-12.2.0
libffi/3.4.4-GCCcore-12.2.0
make/4.3-GCCcore-12.2.0
OpenBLAS/0.3.21-GCC-12.2.0
Python/3.10.8-GCCcore-12.2.0-bare
ScaLAPACK/2.2.0-gompi-2022b-fb
SQLite/3.39.4-GCCcore-12.2.0
Tcl/8.6.12-GCCcore-12.2.0
UnZip/6.0-GCCcore-12.2.0
other under 2023.06/software/linux/aarch64/neoverse_v1
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Aug 09 12:30:11 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-aarch64-neoverse_v1-1691496382.tar.gz to S3 bucket succeeded

@casparvl
Copy link
Collaborator Author

casparvl commented Aug 8, 2023

Created issue for the OpenBLAS test failures:

#314

eb_hooks.py Show resolved Hide resolved
@boegel boegel changed the title {2023.06}[foss-2022b] Added foss-2022b {2023.06} foss/2022b Aug 8, 2023
@casparvl casparvl mentioned this pull request Aug 8, 2023
@boegel boegel added the bot:deploy Ask bot to deploy missing software installations to EESSI label Aug 9, 2023
@boegel boegel merged commit b694709 into EESSI:2023.06 Aug 9, 2023
trz42 pushed a commit to trz42/software-layer that referenced this pull request Apr 7, 2024
@casparvl casparvl deleted the foss_2022b branch August 15, 2024 15:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bot:deploy Ask bot to deploy missing software installations to EESSI pilot-2023.06
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants