[TIR] Refactor BF16Legalize #14405

tqchen · 2023-03-26T16:57:26Z

This PR refactors BF16Legalize to enable more f32 computations.
We also split the BF16Legalize into two steps.

BF16ComputeLegalize changes all computation to f32 while keeping
the external BF16 storages.
BF16StorageLegalize changes all storage to u16.

Now BF16 kernels accept tvm.nd.array that are created as bfloat16 type.

tvm-bot · 2023-03-26T16:57:30Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @Hzfengsy, @junrushao, @quic-sanirudh, @shingjan _{See #10317 for details}

_{Generated by tvm-bot}

tqchen · 2023-03-26T17:00:06Z

cc @yangulei @Menooker

This PR refactors BF16Legalize to enable more f32 computations. We also split the BF16Legalize into two steps. - BF16ComputeLegalize changes all computation to f32 while keeping the external BF16 storages. - BF16StorageLegalize changes all storage to u16. Now BF16 kernels accept tvm.nd.array that are created as bfloat16 type.

tqchen · 2023-03-27T19:54:32Z

note: the android failure is not related to this PR.

tqchen · 2023-03-27T20:03:58Z

cc @vinx13

junrushao · 2023-03-27T20:05:20Z

Retriggering failed Android tests

junrushao · 2023-03-28T01:04:02Z

src/target/llvm/codegen_llvm.cc

+  // TODO(tvm-team): consider add native support
+  ICHECK(!from.is_bfloat16()) << "BF16 needs to be storaged lowered first";
+  ICHECK(!to.is_bfloat16()) << "BF16 needs to be storaged lowered first";
+


seems possible to support software BF16 in upstream LLVM/MLIR world. leaving it to future work

Good point, already left as a todo

junrushao · 2023-03-28T01:05:42Z

tests/python/frontend/onnx/test_forward.py

+        # handle the bfloat16 so we explicitly allocate
+        # bfloat16 arrays as input
+        for i, param in enumerate(mod["main"].params):
+            if param.type_annotation.dtype == "bfloat16":
+                input_data[i] = tvm.nd.empty(input_data[i].shape, "bfloat16").copyfrom(
+                    input_data[i]
+                )


why are we adding ONNX tests in this PR though?

This is needed to patch up onnx converting bf16 to uint16

tqchen force-pushed the bf16-refactor branch 5 times, most recently from 5d9440a to d2011c4 Compare March 27, 2023 00:29

tqchen force-pushed the bf16-refactor branch from d2011c4 to bb9dbb1 Compare March 27, 2023 14:59

junrushao reviewed Mar 28, 2023

View reviewed changes

junrushao approved these changes Mar 28, 2023

View reviewed changes

junrushao merged commit a0edf24 into apache:main Mar 28, 2023

ysh329 mentioned this pull request Apr 17, 2023

[Release] v0.12.0 Release Candidate Notes #14645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TIR] Refactor BF16Legalize #14405

[TIR] Refactor BF16Legalize #14405

tqchen commented Mar 26, 2023 •

edited

Loading

tvm-bot commented Mar 26, 2023

tqchen commented Mar 26, 2023

tqchen commented Mar 27, 2023

tqchen commented Mar 27, 2023

junrushao commented Mar 27, 2023

junrushao Mar 28, 2023

tqchen Mar 28, 2023

junrushao Mar 28, 2023

tqchen Mar 28, 2023

[TIR] Refactor BF16Legalize #14405

[TIR] Refactor BF16Legalize #14405

Conversation

tqchen commented Mar 26, 2023 • edited Loading

tvm-bot commented Mar 26, 2023

tqchen commented Mar 26, 2023

tqchen commented Mar 27, 2023

tqchen commented Mar 27, 2023

junrushao commented Mar 27, 2023

junrushao Mar 28, 2023

Choose a reason for hiding this comment

tqchen Mar 28, 2023

Choose a reason for hiding this comment

junrushao Mar 28, 2023

Choose a reason for hiding this comment

tqchen Mar 28, 2023

Choose a reason for hiding this comment

tqchen commented Mar 26, 2023 •

edited

Loading