Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{data}[fosscuda/2019b] JAX v0.1.77 w/ Python 3.7.4 #11595

Closed

Conversation

edmondac
Copy link
Contributor

@edmondac edmondac commented Oct 30, 2020

(created using eb --new-pr)

Requires easybuilders/easybuild-easyblocks#2262

@edmondac
Copy link
Contributor Author

Test report by @bear-rsg
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in this PR)
bear-pg0212u27a.bear.cluster - Linux centos linux 7.8.2003, x86_64, Intel(R) Xeon(R) CPU E5-2640 v4 @ 2.40GHz, Python 2.7.5
See https://gist.github.com/0e2556e18868fdd34d432a294b74aba2 for a full test report.

@edmondac
Copy link
Contributor Author

Note bear-rsg#371 - which might need a Bazel fix pushing upstream

@surak
Copy link
Contributor

surak commented Oct 30, 2020

I have added the static setting to Bazel, and in fact I get a bazel without the dynamic link to libstdc++:

 ldd /p/software/juwels/stages/Devel-2020/software/Bazel/3.6.0-GCCcore-9.3.0/bin/bazel 
	linux-vdso.so.1 (0x00007ffd4aaf2000)
	librt.so.1 => /usr/lib64/librt.so.1 (0x0000153a2b185000)
	libdl.so.2 => /usr/lib64/libdl.so.2 (0x0000153a2af81000)
	libpthread.so.0 => /usr/lib64/libpthread.so.0 (0x0000153a2ad61000)
	libm.so.6 => /usr/lib64/libm.so.6 (0x0000153a2a9df000)
	libc.so.6 => /usr/lib64/libc.so.6 (0x0000153a2a61d000)
	/lib64/ld-linux-x86-64.so.2 (0x0000153a2b38e000)

And I hit the libstdc++ problem further down the road. A more complete log here:

jax-ml/jax#4743

@edmondac
Copy link
Contributor Author

I'll try uploading some EL8 test reports on haswell and POWER9...

@surak
Copy link
Contributor

surak commented Oct 30, 2020

I'll try uploading some EL8 test reports on haswell and POWER9...

Ok. It was my 0.2.5. I went with 0.1.77 and it installs

@edmondac
Copy link
Contributor Author

Test report by @bear-rsg
SUCCESS
Build succeeded for 51 out of 51 (1 easyconfigs in this PR)
bear-pg0212u15b.bear.cluster - Linux centos linux 8.2.2004, x86_64, Intel(R) Xeon(R) CPU E5-2640 v4 @ 2.40GHz, Python 3.6.8
See https://gist.github.com/1db6f62ad2e51995c055202d4d5696c8 for a full test report.

@boegel boegel added the new label Nov 2, 2020
@boegel boegel added this to the 4.x milestone Nov 2, 2020
@surak
Copy link
Contributor

surak commented Nov 13, 2020

My suggestion comes from @Flamefire having had the same problem in tensorflow on centos 7. I didn't see this issue before because on Centos8, the ld.gold is roughly the same version of that on our toolchain. tensorflow/tensorflow#39263

@boegelbot
Copy link
Collaborator

@edmondac: Tests failed in GitHub Actions, see https://github.com/easybuilders/easybuild-easyconfigs/actions/runs/397069336
Output from first failing test suite run:

ERROR: test__parse_easyconfig_JAX-0.1.77-fosscuda-2019b-Python-3.7.4.eb (test.easyconfigs.easyconfigs.EasyConfigTest)
Test for parsing of easyconfig JAX-0.1.77-fosscuda-2019b-Python-3.7.4.eb
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/easyconfigs/easyconfigs.py", line 1041, in innertest
    template_easyconfig_test(self, spec_path)
  File "test/easyconfigs/easyconfigs.py", line 837, in template_easyconfig_test
    ecs = process_easyconfig(spec)
  File "/opt/hostedtoolcache/Python/2.7.18/x64/lib/python2.7/site-packages/easybuild/framework/easyconfig/easyconfig.py", line 1977, in process_easyconfig
    raise EasyBuildError("Failed to process easyconfig %s: %s", spec, err.msg)
EasyBuildError: "Failed to process easyconfig /home/runner/work/easybuild-easyconfigs/easybuild-easyconfigs/easybuild/easyconfigs/j/JAX/JAX-0.1.77-fosscuda-2019b-Python-3.7.4.eb: No software-specific easyblock 'EB_JAX' found for JAX"

======================================================================
FAIL: test_changed_files_pull_request (test.easyconfigs.easyconfigs.EasyConfigTest)
Specific checks only done for the (easyconfig) files that were changed in a pull request.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/easyconfigs/easyconfigs.py", line 805, in test_changed_files_pull_request
    self.assertTrue(False, error_msg)
AssertionError: Failed to find parsed easyconfig for JAX-0.1.77-fosscuda-2019b-Python-3.7.4.eb (and could not isolate it in easyconfigs archive either)

----------------------------------------------------------------------
Ran 10993 tests in 874.636s

FAILED (failures=1, errors=1)
ERROR: Not all tests were successful.

bleep, bloop, I'm just a bot (boegelbot v20200716.01)
Please talk to my owner @boegel if you notice you me acting stupid),
or submit a pull request to https://github.com/boegel/boegelbot fix the problem.

@edmondac
Copy link
Contributor Author

edmondac commented Dec 3, 2020

I think easybuilders/easybuild-easyblocks#2262 is required before the CI will pass

@boegelbot
Copy link
Collaborator

@edmondac: Tests failed in GitHub Actions, see https://github.com/easybuilders/easybuild-easyconfigs/actions/runs/398095163
Output from first failing test suite run:

ERROR: test__parse_easyconfig_Bazel-2.0.0-GCCcore-8.3.0.eb (test.easyconfigs.easyconfigs.EasyConfigTest)
Test for parsing of easyconfig Bazel-2.0.0-GCCcore-8.3.0.eb
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/easyconfigs/easyconfigs.py", line 1041, in innertest
    template_easyconfig_test(self, spec_path)
  File "test/easyconfigs/easyconfigs.py", line 837, in template_easyconfig_test
    ecs = process_easyconfig(spec)
  File "/opt/hostedtoolcache/Python/2.7.18/x64/lib/python2.7/site-packages/easybuild/framework/easyconfig/easyconfig.py", line 1977, in process_easyconfig
    raise EasyBuildError("Failed to process easyconfig %s: %s", spec, err.msg)
EasyBuildError: "Failed to process easyconfig /home/runner/work/easybuild-easyconfigs/easybuild-easyconfigs/easybuild/easyconfigs/b/Bazel/Bazel-2.0.0-GCCcore-8.3.0.eb: Use of 1 unknown easyconfig parameters detected in Bazel-2.0.0-GCCcore-8.3.0.eb: static\nIf these are just local variables please rename them to start with 'local_', or try using --fix-deprecated-easyconfigs to do this automatically.\nFor more information, see https://easybuild.readthedocs.io/en/latest/Easyconfig-files-local-variables.html ."

======================================================================
ERROR: test__parse_easyconfig_JAX-0.1.77-fosscuda-2019b-Python-3.7.4.eb (test.easyconfigs.easyconfigs.EasyConfigTest)
Test for parsing of easyconfig JAX-0.1.77-fosscuda-2019b-Python-3.7.4.eb
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/easyconfigs/easyconfigs.py", line 1041, in innertest
    template_easyconfig_test(self, spec_path)
  File "test/easyconfigs/easyconfigs.py", line 837, in template_easyconfig_test
    ecs = process_easyconfig(spec)
  File "/opt/hostedtoolcache/Python/2.7.18/x64/lib/python2.7/site-packages/easybuild/framework/easyconfig/easyconfig.py", line 1977, in process_easyconfig
    raise EasyBuildError("Failed to process easyconfig %s: %s", spec, err.msg)
EasyBuildError: "Failed to process easyconfig /home/runner/work/easybuild-easyconfigs/easybuild-easyconfigs/easybuild/easyconfigs/j/JAX/JAX-0.1.77-fosscuda-2019b-Python-3.7.4.eb: No software-specific easyblock 'EB_JAX' found for JAX"

======================================================================
ERROR: test_dep_versions_per_toolchain_generation (test.easyconfigs.easyconfigs.EasyConfigTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/easyconfigs/easyconfigs.py", line 441, in test_dep_versions_per_toolchain_generation
    self.process_all_easyconfigs()
  File "test/easyconfigs/easyconfigs.py", line 121, in process_all_easyconfigs
    EasyConfigTest.parsed_easyconfigs, modules_tool(), retain_all_deps=True)
  File "/opt/hostedtoolcache/Python/2.7.18/x64/lib/python2.7/site-packages/easybuild/tools/robot.py", line 436, in resolve_dependencies
    processed_ecs = process_easyconfig(path, validate=not retain_all_deps, hidden=hidden)
  File "/opt/hostedtoolcache/Python/2.7.18/x64/lib/python2.7/site-packages/easybuild/framework/easyconfig/easyconfig.py", line 1977, in process_easyconfig
    raise EasyBuildError("Failed to process easyconfig %s: %s", spec, err.msg)
EasyBuildError: "Failed to process easyconfig /home/runner/work/easybuild-easyconfigs/easybuild-easyconfigs/easybuild/easyconfigs/b/Bazel/Bazel-2.0.0-GCCcore-8.3.0.eb: Use of 1 unknown easyconfig parameters detected in Bazel-2.0.0-GCCcore-8.3.0.eb: static\nIf these are just local variables please rename them to start with 'local_', or try using --fix-deprecated-easyconfigs to do this automatically.\nFor more information, see https://easybuild.readthedocs.io/en/latest/Easyconfig-files-local-variables.html ."

======================================================================
ERROR: test_sanity_check_paths (test.easyconfigs.easyconfigs.EasyConfigTest)
Make sure specified sanity check paths adher to the requirements.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/easyconfigs/easyconfigs.py", line 504, in test_sanity_check_paths
    self.process_all_easyconfigs()
  File "test/easyconfigs/easyconfigs.py", line 121, in process_all_easyconfigs
    EasyConfigTest.parsed_easyconfigs, modules_tool(), retain_all_deps=True)
  File "/opt/hostedtoolcache/Python/2.7.18/x64/lib/python2.7/site-packages/easybuild/tools/robot.py", line 436, in resolve_dependencies
    processed_ecs = process_easyconfig(path, validate=not retain_all_deps, hidden=hidden)
  File "/opt/hostedtoolcache/Python/2.7.18/x64/lib/python2.7/site-packages/easybuild/framework/easyconfig/easyconfig.py", line 1977, in process_easyconfig
    raise EasyBuildError("Failed to process easyconfig %s: %s", spec, err.msg)
EasyBuildError: "Failed to process easyconfig /home/runner/work/easybuild-easyconfigs/easybuild-easyconfigs/easybuild/easyconfigs/b/Bazel/Bazel-2.0.0-GCCcore-8.3.0.eb: Use of 1 unknown easyconfig parameters detected in Bazel-2.0.0-GCCcore-8.3.0.eb: static\nIf these are just local variables please rename them to start with 'local_', or try using --fix-deprecated-easyconfigs to do this automatically.\nFor more information, see https://easybuild.readthedocs.io/en/latest/Easyconfig-files-local-variables.html ."

======================================================================
FAIL: test_changed_files_pull_request (test.easyconfigs.easyconfigs.EasyConfigTest)
Specific checks only done for the (easyconfig) files that were changed in a pull request.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/easyconfigs/easyconfigs.py", line 805, in test_changed_files_pull_request
    self.assertTrue(False, error_msg)
AssertionError: Failed to find parsed easyconfig for Bazel-2.0.0-GCCcore-8.3.0.eb (and could not isolate it in easyconfigs archive either)

----------------------------------------------------------------------
Ran 11003 tests in 368.157s

FAILED (failures=1, errors=4)
ERROR: Not all tests were successful.

bleep, bloop, I'm just a bot (boegelbot v20200716.01)
Please talk to my owner @boegel if you notice you me acting stupid),
or submit a pull request to https://github.com/boegel/boegelbot fix the problem.

@edmondac
Copy link
Contributor Author

edmondac commented Dec 3, 2020

Test report by @bear-rsg
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
bear-pg0306u19a - Linux RHEL 8.2, POWER, 8335-GTX, Python 3.6.8
See https://gist.github.com/7cd7ec68c6b92f45843b7ca27c26c5e8 for a full test report.

@edmondac
Copy link
Contributor Author

edmondac commented Dec 3, 2020

Test report by @bear-rsg
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
bear-pg0305u05a.bear.cluster - Linux RHEL 7.6, POWER, 8335-GTX, Python 3.6.8
See https://gist.github.com/ee41677edc98dee8648002cb546cb508 for a full test report.

@edmondac
Copy link
Contributor Author

edmondac commented Dec 3, 2020

Test report by @bear-rsg
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
bear-pg0212u15b.bear.cluster - Linux centos linux 8.2.2004, x86_64, Intel(R) Xeon(R) CPU E5-2640 v4 @ 2.40GHz, Python 3.6.8
See https://gist.github.com/a678a7a38383fdb33f8f9be192b396da for a full test report.

@branfosj
Copy link
Member

closing - we'll go with #13622 instead

@branfosj branfosj closed this Aug 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants