Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{chem}[foss/2021a,foss/2021b] LAMMPS v23Jun2022 w/ Python 3.9.5 + 3.9.6 and CUDA #15900

Merged

Conversation

ocaisa
Copy link
Member

@ocaisa ocaisa commented Jul 22, 2022

…, LAMMPS-23Jun2022-foss-2021b-kokkos-CUDA-11.4.1.eb
@ocaisa ocaisa added the update label Jul 22, 2022
@ocaisa
Copy link
Member Author

ocaisa commented Jul 22, 2022

In #14815 a patch was introduced to LAMMPS to force GPU-aware MPI use. However, with my testing of the new approach to CUDA in OpenMPI (#15528) this patch is no longer necessary.

If we do need a patch, I think the right approach is in lammps/lammps#3140 (comment)

@ocaisa ocaisa changed the title {chem}[foss/2021a,foss/2021b] LAMMPS v23Jun2022 w/ Python 3.9.5 + 3.9.6 {chem}[foss/2021a,foss/2021b] LAMMPS v23Jun2022 w/ Python 3.9.5 + 3.9.6 and CUDA Jul 22, 2022
@ocaisa
Copy link
Member Author

ocaisa commented Jul 22, 2022

@boegelbot please test @ generoso EB_ARGS="--cuda-compute-capabilities=8.0"

@boegelbot
Copy link
Collaborator

@ocaisa: Request for testing this PR well received on login1

PR test command 'EB_PR=15900 EB_ARGS="--cuda-compute-capabilities=8.0" /opt/software/slurm/bin/sbatch --job-name test_PR_15900 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 8895

Test results coming soon (I hope)...

- notification for comment with ID 1192685038 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@ocaisa
Copy link
Member Author

ocaisa commented Jul 22, 2022

Seems like the test suite is broken for SYSTEM toolchain: https://github.com/easybuilders/easybuild-easyconfigs/blob/develop/test/easyconfigs/easyconfigs.py#L1421

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
FAILED
Build succeeded for 0 out of 2 (2 easyconfigs in total)
cns1 - Linux Rocky Linux 8.5, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/2134ddf31fc7a514e5ee5631620e9598 for a full test report.

@ocaisa
Copy link
Member Author

ocaisa commented Jul 25, 2022

@boegelbot please test @ jsc-zen2-v100

@boegelbot
Copy link
Collaborator

@ocaisa: Request for testing this PR well received on jsczen2l1.int.jsc-zen2.easybuild-test.cluster

PR test command 'EB_PR=15900 EB_ARGS= /opt/software/slurm/bin/sbatch --job-name test_PR_15900 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen2.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 1407

Test results coming soon (I hope)...

- notification for comment with ID 1193859242 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@ocaisa
Copy link
Member Author

ocaisa commented Jul 25, 2022

Test report by @ocaisa
Using easyblocks from PR(s) easybuilders/easybuild-easyblocks#2213
FAILED
Build succeeded for 0 out of 1 (1 easyconfigs in total)
gpunode1.int.eessi-gpu.learnhpc.eu - Linux Rocky Linux 8.5 (Green Obsidian), x86_64, AMD EPYC 7742 64-Core Processor (zen2), 1 x NVIDIA GRID V100-4C, 460.73.01, Python 3.9.9
See https://gist.github.com/3ff1aa7a72c3dbb183486a9225de156f for a full test report.

EDIT
This failure is due to EESSI filtering out LD_LIBRARYPATH, I've tried to make detection more robust in easybuilders/easybuild-easyblocks#2765

@boegelbot
Copy link
Collaborator

boegelbot commented Jul 25, 2022

Test report by @boegelbot
FAILED
Build succeeded for 1 out of 3 (2 easyconfigs in total)
jsczen2c1.int.jsc-zen2.easybuild-test.cluster - Linux Rocky Linux 8.5, x86_64, AMD EPYC 7742 64-Core Processor (zen2), Python 3.6.8
See https://gist.github.com/668963347025fa98a485f966ac3dba21 for a full test report.

EDIT
These failures are only because the builds were run on a node without GPUs, which causes the sanity checks to fail. The builds themselves were successful

@ocaisa
Copy link
Member Author

ocaisa commented Jul 25, 2022

Test report by @ocaisa
Using easyblocks from PR(s) easybuilders/easybuild-easyblocks#2765
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
gpunode1.int.eessi-gpu.learnhpc.eu - Linux Rocky Linux 8.5 (Green Obsidian), x86_64, AMD EPYC 7742 64-Core Processor (zen2), 1 x NVIDIA GRID V100-4C, 460.73.01, Python 3.9.9
See https://gist.github.com/0a1f3682ccce13abeb5ae8dde3823a31 for a full test report.

@branfosj
Copy link
Member

Test report by @branfosj
Using easyblocks from PR(s) easybuilders/easybuild-easyblocks#2765
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
bear-pg0103u11a.bear.cluster - Linux RHEL 8.6, x86_64, Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz (icelake), 1 x NVIDIA NVIDIA A100-PCIE-40GB, 470.57.02, Python 3.6.8
See https://gist.github.com/ff3f873178d9ed33b3c05d27a4573c51 for a full test report.

@SebastianAchilles
Copy link
Member

@boegelbot please test @ jsc-zen2
EB_ARGS="--include-easyblocks-from-pr=2765"

@boegelbot
Copy link
Collaborator

@SebastianAchilles: Request for testing this PR well received on jsczen2l1.int.jsc-zen2.easybuild-test.cluster

PR test command 'EB_PR=15900 EB_ARGS="--include-easyblocks-from-pr=2765" /opt/software/slurm/bin/sbatch --job-name test_PR_15900 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen2.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 1408

Test results coming soon (I hope)...

- notification for comment with ID 1195923289 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@SebastianAchilles
Copy link
Member

Test report by @SebastianAchilles
Using easyblocks from PR(s) easybuilders/easybuild-easyblocks#2765
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
bdw-opensuse-154 - Linux openSUSE Leap 15.4, x86_64, Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz (broadwell), 2 x NVIDIA NVIDIA GeForce GTX 1060 6GB, 510.73.08, Python 3.6.15
See https://gist.github.com/5ee4874416123169c2720d77257f899f for a full test report.

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
Using easyblocks from PR(s) easybuilders/easybuild-easyblocks#2765
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
jsczen2c1.int.jsc-zen2.easybuild-test.cluster - Linux Rocky Linux 8.5, x86_64, AMD EPYC 7742 64-Core Processor (zen2), Python 3.6.8
See https://gist.github.com/3ad9143f2a611ab3065b78a9cf59fac5 for a full test report.

Copy link
Member

@SebastianAchilles SebastianAchilles left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Can be merged once easybuilders/easybuild-easyblocks#2765 is merged.

@SebastianAchilles
Copy link
Member

Test report by @SebastianAchilles
Using easyblocks from PR(s) easybuilders/easybuild-easyblocks#2765
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
bdw-opensuse-154 - Linux openSUSE Leap 15.4, x86_64, Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz (broadwell), 2 x NVIDIA NVIDIA GeForce GTX 1060 6GB, 510.85.02, Python 3.6.15
See https://gist.github.com/81b3672c806e072e5f13021f73f1ed35 for a full test report.

@boegel
Copy link
Member

boegel commented Aug 29, 2022

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (2 easyconfigs in total)
node3127.skitty.os - Linux RHEL 8.4, x86_64, Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz (skylake_avx512), Python 3.6.8
See https://gist.github.com/40ff2db9461f0749678f1ed32d541031 for a full test report.

@boegel boegel modified the milestones: 4.x, next release (4.6.1?) Aug 29, 2022
Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel
Copy link
Member

boegel commented Aug 29, 2022

Going in, thanks @ocaisa!

@boegel boegel merged commit 5b57d57 into easybuilders:develop Aug 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants