[SYCL][CUDA] Add no-fast-math to tests that rely on it. #9889

JackAKirk · 2023-06-14T21:38:42Z

Same as #9419. This updates a few tests that were missed where the fast-math flag affects at least the cuda backend. These tests assume no-fast-math precision.

Signed-off-by: JackAKirk <jack.kirk@codeplay.com>

aelovikov-intel · 2023-06-14T22:33:09Z

@JackAKirk , can you please merge latest origin/sycl? There were some changes with the lint CI task today, I hope the testing would work after the merge.

JackAKirk · 2023-06-15T10:31:58Z

@JackAKirk , can you please merge latest origin/sycl? There were some changes with the lint CI task today, I hope the testing would work after the merge.

Thanks, I've done this now.

Signed-off-by: JackAKirk <jack.kirk@codeplay.com>

JackAKirk · 2023-07-10T09:29:19Z

@cperkinsintel Would you be able to review this please?

aelovikov-intel · 2023-07-10T14:49:53Z

Modified E2E tests pass in the pre-commit.

cperkinsintel · 2024-01-03T19:04:25Z

sycl/test-e2e/BFloat16/bfloat16_builtins.cpp

 // REQUIRES: aspect-ext_oneapi_bfloat16_math_functions
-// RUN: %clangxx -fsycl -fsycl-targets=%{sycl_triple} %if any-device-is-cuda %{ -Xsycl-target-backend --cuda-gpu-arch=sm_80 %} %s -o %t.out
+// RUN: %clangxx -fsycl -fsycl-targets=%{sycl_triple} %if any-device-is-cuda %{ -Xsycl-target-backend --cuda-gpu-arch=sm_80 %} %s -o %t.out %{mathflags}


@JackAKirk - I'm working with this test and on our slightly older shared CUDA dev machines sm_80 gets rejected. But with sm_75 the test both compiles and behaves as expected.

Can I just switch this to sm_75 ? Or, better yet, given that we have a REQUIRES: aspect-ext_oneapi_bfloat16_math_functions in this test, can the whole %if any-device-is-cuda ... %} block be removed?

The test behaves differently depending on whether it is compiled for sm_xx>=sm_80 or not:

sm_80 and above uses some native bfloat16 math instructions

below sm_80 always uses generic impls

So I set it to compile with sm_80 flag because this is what the CI has, and allows testing of the native impls.

It is possible to remove the arch flag, and it is probably the best thing to do now that bfloat16 is generically supported, to avoid confusion. Unfortunately this means that the native impls won't be tested automatically via the CI. But I suppose we could test this in release testing.

ext_oneapi_bfloat16_math_functions is really an artifact of earlier times when bfloat16 was not generically implemented for all devices: I think it should be removed.

Thanks

Given that, I think I'll have it test both and add a comment summarizing what you just said. Or, at minimum, the comment.

Add fno-fast-math to tests that rely on it in at least the cuda backend.

11cd5b6

Signed-off-by: JackAKirk <jack.kirk@codeplay.com>

JackAKirk requested a review from a team as a code owner June 14, 2023 21:38

JackAKirk requested a review from cperkinsintel June 14, 2023 21:38

Merge branch 'sycl' into no-fast-math-cuda

63929bf

JackAKirk temporarily deployed to aws June 15, 2023 10:27 — with GitHub Actions Inactive

JackAKirk temporarily deployed to aws June 15, 2023 11:06 — with GitHub Actions Inactive

JackAKirk changed the title ~~[SYCL][CUDA] Add fno-fast-math to tests that rely on it.~~ [SYCL][CUDA] Add no-fast-math to tests that rely on it. Jun 16, 2023

Added -fno-fast-math flag to newly failing affected test.

35b9d1f

Signed-off-by: JackAKirk <jack.kirk@codeplay.com>

JackAKirk temporarily deployed to aws June 27, 2023 10:46 — with GitHub Actions Inactive

JackAKirk temporarily deployed to aws June 27, 2023 11:20 — with GitHub Actions Inactive

Merge branch 'sycl' into no-fast-math-cuda

b18d4f6

JackAKirk temporarily deployed to aws July 10, 2023 09:22 — with GitHub Actions Inactive

JackAKirk temporarily deployed to aws July 10, 2023 09:57 — with GitHub Actions Inactive

aelovikov-intel approved these changes Jul 10, 2023

View reviewed changes

aelovikov-intel merged commit fde72c6 into intel:sycl Jul 10, 2023
10 of 12 checks passed

cperkinsintel reviewed Jan 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][CUDA] Add no-fast-math to tests that rely on it. #9889

[SYCL][CUDA] Add no-fast-math to tests that rely on it. #9889

JackAKirk commented Jun 14, 2023

aelovikov-intel commented Jun 14, 2023

JackAKirk commented Jun 15, 2023

JackAKirk commented Jul 10, 2023

aelovikov-intel commented Jul 10, 2023

cperkinsintel Jan 3, 2024

JackAKirk Jan 3, 2024 •

edited

Loading

cperkinsintel Jan 3, 2024 •

edited

Loading

[SYCL][CUDA] Add no-fast-math to tests that rely on it. #9889

[SYCL][CUDA] Add no-fast-math to tests that rely on it. #9889

Conversation

JackAKirk commented Jun 14, 2023

aelovikov-intel commented Jun 14, 2023

JackAKirk commented Jun 15, 2023

JackAKirk commented Jul 10, 2023

aelovikov-intel commented Jul 10, 2023

cperkinsintel Jan 3, 2024

Choose a reason for hiding this comment

JackAKirk Jan 3, 2024 • edited Loading

Choose a reason for hiding this comment

cperkinsintel Jan 3, 2024 • edited Loading

Choose a reason for hiding this comment

JackAKirk Jan 3, 2024 •

edited

Loading

cperkinsintel Jan 3, 2024 •

edited

Loading