[CI] Make Graviton3 default AArch64 job runner node #15352

Mousius · 2023-07-18T16:30:30Z

In order to support SVE testing, migrating the current default AArch64 nodes to Graviton3 based nodes. Using r7g.large instances which have the memory requirements to support the TVM workloads.

tvm-bot · 2023-07-18T16:30:33Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @areusch, @leandron _{See #10317 for details}

_{Generated by tvm-bot}

tqchen · 2023-07-18T17:33:08Z

ci/jenkins/templates/arm_jenkinsfile.groovy.j2

@@ -19,7 +19,7 @@

 {% call m.invoke_build(
  name='BUILD: arm',
-  node='ARM-SMALL',
+  node='ARM-GRAVITON3',


Thanks, it would be useful to have a analysis of cost and the way we structure the tests.

As of now running the UTs directly through e2e compilation can take up a lot of CI time. A lot of that comes from tests that are as a matter of fact integration tests

Would be great for us to isolate out a limited set of integration tests (in cases with tests/arm_sve/) and only run limited set of testcases over these would be useful. Like our require_cuda tag, while majority of tests do not have to go through the specific nodes

tqchen · 2023-07-18T17:35:56Z

Thanks, it would be useful to have a analysis of cost of the new instance.

As of now running the UTs directly through e2e compilation can take up a lot of CI time. A lot of that comes from tests that likely do not need SVE.

My understanding is that we will need SVE for some of the integration tests. Ideally we should isolate out a limited set of integration tests(e.g. via tests/arm_sve/) and only run those. We can also have tests can be enabled through require_sve tag, that optionally disables the related tests when they are not available. As such majority of tests do not have to go through the specific hw.

Most remainder of the tests can be structured through UTs and likely do not need SVE

Mousius · 2023-07-18T21:41:43Z

@tqchen the new instance type is slightly more expensive on paper, as detailed below:

CI	Instance Type	On-Demand	Reserved	Minimum Spot
`ARM-GRAVITON3`	r7g.large	$87.5270 monthly	$57.8890 monthly	$49.1290 monthly
`ARM-SMALL`	r6g.large	$82.3440 monthly	$51.9030 monthly	$48.0340 monthly

However, the new generation of instance has been proven to improve performance (see: Re:invent presentation). Which indicates this is an improvement for CI costs.

If you look at the diff, this replaces the r6g.large instances with r7g.large instances and does not add any additional nodes, replacing the existing AArch64 CI with a new instance type that can also run the SVE tests. That means the SVE tests will already be targeted appropriately to AArch64, and this isn't an entirely new set of nodes specifically catering to just SVE.

tqchen · 2023-07-18T22:07:27Z

OK get it, seems to be good on this

[CI] Make Graviton3 default AArch64 job runner node

c48d0e3

In order to support SVE testing, migrating the current default AArch64 nodes to Graviton3 based nodes. Using r7g.large instances which have the memory requirements to support the TVM workloads.

tqchen requested changes Jul 18, 2023

View reviewed changes

tqchen approved these changes Jul 18, 2023

View reviewed changes

ashutosh-arm merged commit 0603cce into main Jul 19, 2023

junrushao deleted the Make_Graviton3_default_AArch64_job_runner_node branch July 21, 2023 06:27

ysh329 mentioned this pull request Oct 18, 2023

[Release] v0.14.0 Release Candidate Notes #15948

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI] Make Graviton3 default AArch64 job runner node #15352

[CI] Make Graviton3 default AArch64 job runner node #15352

Mousius commented Jul 18, 2023

tvm-bot commented Jul 18, 2023

tqchen Jul 18, 2023

tqchen commented Jul 18, 2023 •

edited

Loading

Mousius commented Jul 18, 2023

tqchen commented Jul 18, 2023

[CI] Make Graviton3 default AArch64 job runner node #15352

[CI] Make Graviton3 default AArch64 job runner node #15352

Conversation

Mousius commented Jul 18, 2023

tvm-bot commented Jul 18, 2023

tqchen Jul 18, 2023

Choose a reason for hiding this comment

tqchen commented Jul 18, 2023 • edited Loading

Mousius commented Jul 18, 2023

tqchen commented Jul 18, 2023

tqchen commented Jul 18, 2023 •

edited

Loading