[Backport] Support XLA_USE_BF16 #6841

alanwaketan · 2024-03-27T21:56:04Z

Summary:
XLA_USE_BF16=1 will make all the internal xla tensors to use BF16 but torch.tensor wrappers will still return torch.float. To address this, we need to set the jax tracers correctly to produce the correct Mosaic.

Test Plan:
PJRT_DEVICE=TPU python test/test_pallas.py -v -k test_flash_attention_wrapper_bf16

Summary: XLA_USE_BF16=1 will make all the internal xla tensors to use BF16 but torch.tensor wrappers will still return torch.float. To address this, we need to set the jax tracers correctly to produce the correct Mosaic. Test Plan: PJRT_DEVICE=TPU python test/test_pallas.py -v -k test_flash_attention_wrapper_bf16 address comments

alanwaketan · 2024-03-28T18:35:59Z

Thanks Jack and Siyuan.

alanwaketan requested review from lsy323 and JackCaoG March 27, 2024 21:56

alanwaketan mentioned this pull request Mar 27, 2024

2.3 backport PR request list #6676

Closed

JackCaoG approved these changes Mar 27, 2024

View reviewed changes

lsy323 merged commit c6a8874 into r2.3 Mar 28, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Backport] Support XLA_USE_BF16 #6841

[Backport] Support XLA_USE_BF16 #6841

alanwaketan commented Mar 27, 2024

alanwaketan commented Mar 28, 2024

[Backport] Support XLA_USE_BF16 #6841

[Backport] Support XLA_USE_BF16 #6841

Conversation

alanwaketan commented Mar 27, 2024

alanwaketan commented Mar 28, 2024