Adjust allclose atol for the flash attention TPU test #6889

JackCaoG · 2024-04-04T20:31:32Z

This should fix the TPU CI Failure, confirmed locally.

alanwaketan · 2024-04-04T20:33:15Z

test/test_pallas.py

@@ -205,7 +205,7 @@ def test_flash_attention_wrapper(self):

    o = flash_attention(q, k, v)
    expected_o = self._attention(q, k, v)
-    self.assertTrue(torch.allclose(o.cpu(), expected_o.cpu()))
+    self.assertTrue(torch.allclose(o.cpu(), expected_o.cpu(), atol=1e-07))


How about using 1e-4? Just in case.

let's do 1e-5 lol, we also set the precision to highest in the test.

alanwaketan

LGTM.

JackCaoG · 2024-04-04T21:18:59Z

TPU CI passed, I am just going to merge.

Adjust allclose atol for the flash attention TPU test

a673b2f

JackCaoG requested review from alanwaketan and lsy323 April 4, 2024 20:31

alanwaketan reviewed Apr 4, 2024

View reviewed changes

lint

d7896b1

alanwaketan approved these changes Apr 4, 2024

View reviewed changes

JackCaoG merged commit 73de972 into master Apr 4, 2024
3 of 5 checks passed

ysiraichi mentioned this pull request Apr 8, 2024

Failing Torchbench Models: tracking issue #5932

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust allclose atol for the flash attention TPU test #6889

Adjust allclose atol for the flash attention TPU test #6889

JackCaoG commented Apr 4, 2024

alanwaketan Apr 4, 2024

JackCaoG Apr 4, 2024

alanwaketan left a comment

JackCaoG commented Apr 4, 2024

Adjust allclose atol for the flash attention TPU test #6889

Adjust allclose atol for the flash attention TPU test #6889

Conversation

JackCaoG commented Apr 4, 2024

alanwaketan Apr 4, 2024

Choose a reason for hiding this comment

JackCaoG Apr 4, 2024

Choose a reason for hiding this comment

alanwaketan left a comment

Choose a reason for hiding this comment

JackCaoG commented Apr 4, 2024